image masking schemes for local manifold learning...

22
Marco F. Duarte Image Masking Schemes for Local Manifold Learning Methods Joint work with Hamid Dadkhahi IEEE ICASSP - April 23 2015

Upload: others

Post on 07-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Marco F. Duarte

Image Masking Schemes for Local Manifold Learning Methods

Joint work with Hamid Dadkhahi IEEE ICASSP - April 23 2015

Page 2: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Manifold Learning

• Given training points in , learn the mapping �to the underlying K-dimensional articulation manifold

• Exploit local geometry to capture parameter differences by embedding distances

• ISOMAP, LLE, HLLE, …

• Ex:�images of �� rotating teapot��articulation space�� = circle

Page 3: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Compressive Manifold Learning

• Isomap algorithm approximates geodesic distances using distances between neighboring points

• Random measurements preserve these distances

• Theorem: If , then the Isomap residual variance in the projected domain is bounded by the additive � error factor

N = 4096 (Full Data) M = 100 M = 50 M = 25

translating disk manifold

(K=2)

[Hegde, Wakin,Baraniuk 2008]

• Given training points in , learn the mapping �to the underlying K-dimensional articulation manifold

Page 4: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Custom Projection Operators

•Goal of Dimensionality Reduction: To preserve distances between points in the manifold, i.e., for

•Collect pairwise differences into set of secant vectors

•Search for projection that preserves norms of secants:

[Hegde, Sankaranarayanan, Baraniuk 2012]

Page 5: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

•Usual approach: Principal Component Analysis (PCA) •Collect all secants into a matrix:

• Perform eigenvalue decomposition on S:

•Select top eigenvectors as projections

• PCA minimizes the average squared distortion over secants, but can distort individual secants arbitrarily and therefore warp manifold structure

Custom Projection Operators

[Hegde, Sankaranarayanan, Baraniuk 2012]

Page 6: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

• For target distortion , find matrix featuring the smallest number of rows that yields

• This is equivalent to minimizing the rankof the matrix such that

•Use nuclear norm as proxy for rank to obtain computationally efficient approach • Improves over random projections since matrix is

specifically tailored to manifold observed •May be difficult to link target distortion to matrix rank/

number of rows

Custom Projection Operators: NuMax

[Hegde, Sankaranarayanan, Baraniuk 2012]

Page 7: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

• Projections matrices have entries with arbitrary values • Physics of sensing process,

hardware devices restrict types of projections we can obtain • Example: Low-power imaging

for computational eyeglasses • Low-power imaging sensor

allows for individual selection of pixels to record • Power consumption proportional

to number of pixels sampled •Random projections/NuMax involve

half/all pixels and do not enable power savings •How to derive constrained

projection matrices that involve only few pixels?

Issues with Randomness and NuMax

[Mayberry, Hu, Marlin, Salthouse, Ganesan 2014]

Page 8: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

•Select only a subset of the pixels of size M that minimizes distortion to manifold structure

• Emulate strategies for projection design into mask design

•Random Masking: Pick M pixels uniformly at random across image

Masking Strategies for Manifold Data

M = 100 pixels

Page 9: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

•Select only a subset of the pixels of size M that minimizes distortion to manifold structure

• Emulate strategies for projection design into mask design

•Principal coordinate analysis: Pick M coordinates that maximize variance among secants

Masking Strategies for Manifold Data

M = 100 pixels

[Dadkhahi and Duarte 2014]

Page 10: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

•Select only a subset of the pixels of size M that minimizes distortion to manifold structure • Emulate strategies for projection

design into mask design

•Adaptation of NuMax: •Define secants from k-nearest

neighbor graph:

• Pick masking matrix (row submatrix of I) to minimize secant norm distortion after scaling:

•Combinatorial integer program replaced by greedy approximation

Masking Strategies for Manifold Data

M = 100 pixels

[Dadkhahi and Duarte 2014]

Page 11: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Isomap vs. Locally Linear Embedding

•While Isomap employs distances between neighbors when designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, representing each vector as a weighted linear combination of its neighbors

• In particular, Isomap embedding is sensitive to scaling of the point clouds, while LLE isn’t

Page 12: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Isomap vs. Locally Linear Embedding

•While Isomap employs distances between neighbors when designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, representing each vector as a weighted linear combination of its neighbors

• In particular, Isomap embedding is sensitive to scaling of the point clouds, while LLE isn’t

Page 13: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Isomap vs. Locally Linear Embedding

•While Isomap employs distances between neighbors when designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, representing each vector as a weighted linear combination of its neighbors

• In particular, Isomap embedding is sensitive to scaling of the point clouds, while LLE isn’t

• To preserve this additional local information, we expand the set of secants to include distances between neighbors of each point

Page 14: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

•While Isomap employs distances between neighbors when designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, representing each vector as a weighted linear combination of its neighbors

• In particular, Isomap embedding is sensitive to scaling of the point clouds, while LLE isn’t

• To preserve this additional local information, we expand the set of secants to include distances between neighbors of each point

Isomap vs. Locally Linear Embedding

Page 15: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

• Expand the set of secants considered:

•Compute squared norms of secants in obtained from the original images and from masked images; collect into “norm” vectors

•Choose mask that maximizes sum of cosine similarities between original and masked “norm” vectors:

•Replace combinatorial optimization by greedy forward selection algorithm •Cosine similarity is invariant to (local) scaling of point cloud

Manifold-Aware Pixel Selection for LLE

Page 16: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Manifold-Aware Pixel Selection for LLE

M = 100 pixelsM = 100 pixels

Page 17: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

50 100 150 200 250 300

2

4

6

8

10

Eyeglasses

Embedding Dim./Masking Size m

Em

beddin

g E

rror

Full DataPCASPCAPCoAMAPS−LLEMAPS−IsomapRandom

Performance Analysis: LLE Embedding Error

M50 100 150 200 250 300

2

4

6

8

10

Eyeglasses

Embedding Dim./Masking Size m

Em

beddin

g E

rror

Full DataPCASPCAPCoAMAPS−LLEMAPS−IsomapRandom

Page 18: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

50 100 150 200 250 300

500

1000

1500

2000

2500

MNIST

Embedding Dim./Masking Size m

Em

be

dd

ing

Err

or

M

Performance Analysis: LLE Embedding Error

50 100 150 200 250 300

2

4

6

8

10

Eyeglasses

Embedding Dim./Masking Size m

Em

beddin

g E

rror

Full DataPCASPCAPCoAMAPS−LLEMAPS−IsomapRandom

Page 19: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Performance Analysis: LLE Embedding Error

50 100 150 200 250 300

10

20

30

40

50

Heads

Embedding Dim./Masking Size m

Em

be

dd

ing

Err

or

M50 100 150 200 250 300

2

4

6

8

10

Eyeglasses

Embedding Dim./Masking Size m

Em

beddin

g E

rror

Full DataPCASPCAPCoAMAPS−LLEMAPS−IsomapRandom

50 100 150 200 250 300

2

4

6

8

10

Eyeglasses

Embedding Dim./Masking Size m

Em

beddin

g E

rror

Full DataPCASPCAPCoAMAPS−LLEMAPS−IsomapRandom

Page 20: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Performance Analysis: 2-D LLE

M = 100 pixels

Page 21: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

50 100 150 200 250 300

25

30

35

40

45

Embedding Dim./Masking Size m

Ave

rag

e G

aze

Est

ima

tion

Err

or

Full DataPCASPCAPCoAMAPS−LLEMAPS−IsomapRandom

Computational Eyeglasses: Eye Gaze Tracking

M

Error measuredin “target” pixels

Page 22: Image Masking Schemes for Local Manifold Learning Methodsmduarte/images/IMSLMLM-ICASSP2015-Slides.pdf• New sensors enable CS by masking images, i.e., restricting the type of projections

Conclusions• Compressive sensing (CS) for manifold-modeled images

via random or customized projections (NuMax) • New sensors enable CS by masking images,

i.e., restricting the type of projections • Our MAPS algorithms find image masks that best

preserve geometric structure used during manifold learning for image datasets

• Greedy algorithms provide good preservation of learned manifold embeddings, suitable for parameter estimation

• While Isomap relies on distances between neighbors, LLE also leverages local geometric structure; different algorithms are optimal for these cases

• Concept of subsampling as feature selection - supervised and unsupervised learning?

http://www.ecs.umass.edu/~mduarte [email protected]