overview - math.nyu.educfgranda/pages/obda_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) the dark...

81
Overview Optimization-Based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/OBDA_spring16 Carlos Fernandez-Granda 1/25/2016

Upload: others

Post on 09-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Overview

Optimization-Based Data Analysishttp://www.cims.nyu.edu/~cfgranda/pages/OBDA_spring16

Carlos Fernandez-Granda

1/25/2016

Page 2: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 3: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 4: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising

Additive noise model

data = signal + noise

Page 5: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Hard thresholding

Hη (x)i :=

xi if |xi | > η,

0 otherwise

Page 6: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding

Data

Signal

Page 7: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding

Estimate

Signal

Page 8: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sparsifying transforms

x = Dc =n∑

i=1

Di ci ,

Page 9: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Discrete cosine transform

Signal DCT coefficients

Page 10: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Wavelets

Page 11: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sorted wavelet coefficients

10−3

10−1

101

103

Page 12: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding in a basis

x = Bc

y = Bc + z

c = Hη(B−1y

)y = Bc

Page 13: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding in a DCT basis

DCT coefficients

Data

Signal

Data

Page 14: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding in a DCT basis

DCT coefficients

Estimate

Signal

Estimate

Page 15: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding in a wavelet basis

Original Noisy Estimate

Page 16: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding in a wavelet basis

Original Noisy Estimate

Page 17: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via thresholding in a wavelet basis

Original Noisy Estimate

Page 18: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Overcomplete dictionary

DCT coefficients

Page 19: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Overcomplete dictionary

x = Dc =[A B

] [ab

]= Aa + Bb

Page 20: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sparsity estimation

First idea:

minc∈Rm

||c ||0 such that x = Dc

Computationally intractable!

Tractable alternative:

minc∈Rm

||c ||1 such that x = Dc

Page 21: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Geometric intuition

c1

c2Min. `1-norm solution

Min. `2-norm solution

Dc = x

Page 22: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sparsity estimation

DCT subdictionarySpike subdictionary

Page 23: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Minimum `2-norm coefficients

DCT subdictionarySpike subdictionary

Page 24: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Minimum `1-norm coefficients

DCT subdictionarySpike subdictionary

Page 25: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via `1-norm regularized least squares

c = arg minc∈Rm

||x − Dc ||22 + λ ||c ||1

x = Dc

Page 26: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via `1-norm regularized least squares

Page 27: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via `1-norm regularized least squares

SignalEstimate

Page 28: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via `1-norm regularized least squares

SignalEstimate

Page 29: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Learning the dictionary

minC∈Rm×k

∣∣∣∣∣∣X − DC∣∣∣∣∣∣2

F+ λ ||c ||1 such that

∣∣∣∣∣∣Di

∣∣∣∣∣∣2

= 1, 1 ≤ i ≤ m

Page 30: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Dictionary learning

Page 31: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via dictionary learning

Page 32: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via dictionary learning

Page 33: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Denoising via dictionary learning

Page 34: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 35: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Linear regression

yi ≈p∑

j=1

θjXij , 1 ≤ i ≤ n

y ≈ Xθ

Page 36: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sparse regression

Least squares

θls := arg minθ∈Rn

∣∣∣∣∣∣y − X θ∣∣∣∣∣∣

2

Ridge regression (aka Tikhonov regularization)

θridge := arg minθ∈Rn

∣∣∣∣∣∣y − X θ∣∣∣∣∣∣2

2+ λ

∣∣∣∣∣∣θ∣∣∣∣∣∣22

The lasso

θlasso := arg minθ∈Rn

∣∣∣∣∣∣y − X θ∣∣∣∣∣∣2

2+ λ

∣∣∣∣∣∣θ∣∣∣∣∣∣1

Page 37: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Ridge-regression coefficients

10-4 10-3 10-2 10-1 100 101 102 103 104 105

Regularization parameter

0.5

0.0

0.5

1.0

1.5

2.0C

oeff

icie

nts

Relevant features

Page 38: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Lasso coefficients

10-4 10-3 10-2 10-1 100 101 102 103 104 105

Regularization parameter

0.5

0.0

0.5

1.0

1.5

2.0C

oeff

icie

nts

Relevant features

Page 39: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Results

10-4 10-3 10-2 10-1 100 101 102 103 104 105

Regularization parameter

0.2

0.4

0.6

0.8

1.0

Rela

tive e

rror

(l2

norm

)

Least squares (training)

Least squares (test)

Ridge (training)

Ridge (test)

Lasso (training)

Lasso (test)

Page 40: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 41: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Limits of resolution in imaging

The resolving power of lenses, however perfect, is limited (Lord Rayleigh)

Diffraction imposes a fundamental limit on the resolution of optical systems

Page 42: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Fluorescence microscopy

Data

Point sources Low-pass blur

(Figures courtesy of V. Morgenshtern)

Page 43: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sensing model for super-resolution

Point sourcesPoint-spreadfunction

Data

∗ =

Spectrum × =

Page 44: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Minimum `1-norm estimate

minimize ||estimate||1subject to estimate ∗ psf = data

Point sources Estimate

Page 45: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Magnetic resonance imaging

Page 46: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Magnetic resonance imaging

I Data: Samples from spectrumI Problem: Sampling is time consuming (patients get annoyed,

kids move during data acquisition)I Images are compressible (≈ sparse)I Can we recover compressible signals from less data?

Page 47: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Compressed sensing

1. Undersample the spectrum randomly

Signal Spectrum

Data

Page 48: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Compressed sensing

2. Solve the optimization problem

minimize ||estimate||1subject to frequency samples of estimate = data

Signal Estimate

Page 49: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Compressed sensing in MRI

OriginalMin. `2-normestimate

Min. `1-normestimate

Page 50: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 51: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 52: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Netflix Prize

? ? ? ?

?

?

??

??

???

?

?

Page 53: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Collaborative filtering

A :=

Bob Molly Mary Larry

1 1 5 4 The Dark Knight2 1 4 5 Spiderman 34 5 2 1 Love Actually5 4 2 1 Bridget Jones’s Diary4 5 1 2 Pretty Woman1 2 5 5 Superman 2

Page 54: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Centering

µ :=1n

m∑i=1

n∑j=1

Aij ,

A :=

µ µ · · · µµ µ · · · µ· · · · · · · · · · · ·µ µ · · · µ

Page 55: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SVD

A− A = USV T = U

7.79 0 0 00 1.62 0 00 0 1.55 00 0 0 0.62

V T

Page 56: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

First left singular vector

U1 =D. Knight Sp. 3 Love Act. B.J.’s Diary P. Woman Sup. 2

( )−0.45 −0.39 0.39 0.39 0.39 −0.45

Page 57: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

First right singular vector

V1 =Bob Molly Mary Larry

( )0.48 0.52 −0.48 −0.52

Page 58: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Rank 1 model

A + σ1U1V T1 =

Bob Molly Mary Larry

1.34 (1) 1.19 (1) 4.66 (5) 4.81 (4) The Dark Knight1.55 (2) 1.42 (1) 4.45 (4) 4.58 (5) Spiderman 34.45 (4) 4.58 (5) 1.55 (2) 1.42 (1) Love Actually4.43 (5) 4.56 (4) 1.57 (2) 1.44 (1) B.J.’s Diary4.43 (4) 4.56 (5) 1.57 (1) 1.44 (2) Pretty Woman1.34 (1) 1.19 (2) 4.66 (5) 4.81 (5) Superman 2

Page 59: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Matrix completion

Bob Molly Mary Larry

1 ? 5 4 The Dark Knight? 1 4 5 Spiderman 34 5 2 ? Love Actually5 4 2 1 Bridget Jones’s Diary4 5 1 2 Pretty Woman1 2 ? 5 Superman 2

Page 60: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Low-rank matrix estimation

First idea:

minX∈Rm×n

rank(X)

such that XΩ ≈ y

Computationally intractable because of missing entries

Tractable alternative:

minX∈Rm×n

∣∣∣∣∣∣XΩ − y∣∣∣∣∣∣2

2+ λ

∣∣∣∣∣∣X ∣∣∣∣∣∣∗

Page 61: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Matrix completion via nuclear-norm minimization

Bob Molly Mary Larry

1 2 (1) 5 4 The Dark Knight

2 (2) 1 4 5 Spiderman 34 5 2 2 (1) Love Actually5 4 2 1 Bridget Jones’s Diary4 5 1 2 Pretty Woman1 2 5 (5) 5 Superman 2

Page 62: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 63: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Background subtraction

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 64: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Low rank + sparse model

minL,S∈Rm×n

∣∣∣∣∣∣L∣∣∣∣∣∣∗

+ λ∣∣∣∣∣∣S∣∣∣∣∣∣

1such that L + S = Y

Page 65: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Frame 17

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 66: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Low-rank component

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 67: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sparse component

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 68: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Frame 42

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 69: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Low-rank component

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 70: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sparse component

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 71: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Frame 75

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 72: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Low-rank component

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 73: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Sparse component

20 40 60 80 100 120 140 160

20

40

60

80

100

120

Page 74: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SparsityDenoisingRegressionInverse problems

Low-rank modelsMatrix completionLow rank + sparse modelNonnegative matrix factorization

Page 75: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Topic modeling

A :=

singer GDP senate election vote stock bass market band Articles

6 1 1 0 0 1 9 0 8 a1 0 9 5 8 1 0 1 0 b8 1 0 1 0 0 9 1 7 c0 7 1 0 0 9 1 7 0 d0 5 6 7 5 6 0 7 2 e1 0 8 5 9 2 0 0 1 f

Page 76: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

SVD

A− A = USV T = U

19.32 0 0 00 14.46 0 0 0 00 0 4.99 0 0 00 0 0 2.77 0 00 0 0 0 1.67 00 0 0 0 0 0.93

V T

Page 77: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Left singular vectors

a b c d e f( )U1 = −0.51 −0.40 −0.54 −0.11 −0.38 −0.38( )U2 = 0.19 −0.45 −0.19 −0.69 −0.2 −0.46( )U3 = 0.14 −0.27 −0.09 −0.58 −0.69 −0.29

Page 78: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Right singular vectors

singer GDP senate election vote stock bass market band

( )V1 = −0.38 0.05 0.40 0.27 0.40 0.17 −0.52 0.14 −0.38( )V2 = 0.16 −0.46 0.33 0.15 0.38 −0.49 0.10 −0.47 0.12( )V3 = −0.18 −0.18 −0.04 −0.74 −0.05 0.11 −0.10 −0.43 −0.43

Page 79: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Nonnegative matrix factorization

M ≈WH, Wi ,j ≥ 0, 1 ≤ i ≤ m, 1 ≤ j ≤ r ,Hi ,j ≥ 0, 1 ≤ i ≤ r , 1 ≤ i ≤ n,

Page 80: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Right nonnegative factors

singer GDP senate election vote stock bass market band

( )H1 = 0.34 0 3.73 2.54 3.67 0.52 0 0.35 0.35( )H2 = 0 2.21 0.21 0.45 0 2.64 0.21 2.43 0.22( )H3 = 3.22 0.37 0.19 0.2 0 0.12 4.13 0.13 3.43

Page 81: Overview - math.nyu.educfgranda/pages/OBDA_spring16/... · 1:34(1) 1:19(1) 4:66(5) 4:81(4) The Dark Knight 1:55(2) 1:42(1) 4:45(4) 4:58(5) Spiderman 3 4 :45 ( )58 5 155 2 42 Love

Left nonnegative factors

a b c d e f( )W1 = 0.03 2.23 0 0 1.59 2.24( )W2 = 0.1 0 0.08 3.13 2.32 0( )W3 = 2.13 0 2.22 0 0 0.03