alignment and matching thomas funkhouser and michael kazhdan princeton university thomas funkhouser...

41
Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University

Upload: magdalene-lawson

Post on 29-Dec-2015

222 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Alignment and MatchingAlignment and Matching

Thomas Funkhouser and Michael Kazhdan

Princeton University

Thomas Funkhouser and Michael Kazhdan

Princeton University

Page 2: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Challenge

The shape of model does not changewhen the model is translated, scaled,or rotated

Translation

Scale

Rotation

=

Page 3: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Outline

Matching• Alignment

Exhaustive Search Invariance Normalization

• Part vs. Whole Conclusion

Page 4: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Exhaustive Search

Search for the best aligning transformation:• Compare at all alignments• Match at the alignment for which models are

closest

Exhaustive search for optimal rotation

Page 5: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Exhaustive Search

Search for the best aligning transformation:• Compare at all alignments• Match at the alignment for which models are

closest

Page 6: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Exhaustive Search

Search for the best aligning transformation:• Use signal processing for efficient correlation• Represent model at many different

transformations

Properties:• Gives the correct answer• Is hard to do efficiently

Page 7: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Outline

Matching• Alignment

Exhaustive Search Invariance Normalization

• Part vs. Whole Conclusion

Page 8: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Invariance

Represent a model with information that is independent of the transformation:

• Extended Gaussian Image, Horn: Translation invariant• Shells Histograms, Ankerst: Rotation invariant• D2 Shape Distributions, Osada: Translation/Rotation

invariant

EGI Shells Histogram D2 Distribution

Page 9: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Invariance

Represent a model with information that is independent of the transformation• Power spectrum representation

Fourier Transform for translation and 2D rotations

Spherical Harmonic Transform for 3D rotations

Frequency

Ene

rgy

Frequency

Ene

rgy

Circular Power Spectrum Spherical Power Spectrum

Page 10: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Translation Invariance

1D Function

Page 11: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Translation Invariance

+ + += + …

Cosine/Sine Decomposition

1D Function

Page 12: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Translation Invariance

=

+ + +

Constant

= + …

Frequency Decomposition

1D Function

Page 13: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Translation Invariance

=

+ + +

+

Constant 1st Order

= + …+

Frequency Decomposition

1D Function

Page 14: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Translation Invariance

=

+ + +

+ +

Constant 1st Order 2nd Order

= + …+

Frequency Decomposition

1D Function

Page 15: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Translation Invariance

=

+ + +

+ + +

Constant 1st Order 2nd Order 3rd Order

= + …

+ …

+

Frequency Decomposition

1D Function

Page 16: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

+ + + + …+

Translation Invariance

= + + +

Constant 1st Order 2nd Order 3rd Order

+ …

Frequency Decomposition

=Amplitudes invariant

to translation1D Function

Page 17: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Rotation Invariance

Represent each spherical function as a sum of harmonic frequencies (orders)

+ += +

+ + +

Constant 1st Order 2nd Order 3rd Order

Page 18: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Rotation Invariance

Store “how much” (L2-norm) of the shape resides in each frequency to get a rotation invariant representation

+ + +

Constant 1st Order 2nd Order 3rd Order

=

Page 19: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Power Spectrum

Translation-invariance:• Represent the model in a Cartesian coordinate

system• Compute the 3D Fourier transform• Store the amplitudes of the frequency components

nml

znmylxinml efzyxf

,,

)(,,),,(

nmlnmlf ,,,,

Cartesian Coordinates

Translation Invariant Representation

zx

y

Page 20: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Power Spectrum

Single axis rotation-invariance:• Represent the model in a cylindrical coordinate

system• Compute the Fourier transform in the angular

direction• Store the amplitudes of the frequency components

k

kik ehrfhrf )(),(),,(

kk hrf ),(

Cylindrical Coordinates

Rotation Invariant Representation

h

r

Page 21: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Power Spectrum

Full rotation-invariance:• Represent the model in a spherical coordinate

system• Compute the spherical harmonic transform • Store the amplitudes of the frequency components

l lm

mlml Yrfrf ),()(),,( ,

llm

ml rf

2)(

Spherical Coordinates

Rotation Invariant Representation

r

Page 22: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Power Spectrum

Power spectrum representations• Are invariant to transformations• Give a lower bound for the best match• Tend to discard too much information

Translation invariant: n3 data -> n3/2 data Single-axis rotation invariant: n3 data -> n3/2

data Full rotation invariant: n3 data -> n2

data

Page 23: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Power Spectrum

Shells Histogram

Method Translation Rotation

EGI Constant Order

Crease Histograms Constant Order Spherical: Constant Order

D2 Square Spherical: Constant Order

Shells Spherical: Constant Order

Spherical Extent Cylindrical: Full

Harmonic Descriptor

Spherical: Full

Page 24: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Outline

Matching• Alignment

Exhaustive Search Invariance Normalization

• Part vs. Whole Conclusion

Page 25: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Normalization

Place a model into a canonical coordinate frame by normalizing for:• Translation• Scale• Rotation

Translation

Scale

Rotation

Page 26: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Normalization

Place a model into a canonical coordinate frame by normalizing for:• Translation: Center of mass• Scale• Rotation

Initial Models Translation-Aligned Models

[Horn et al., 1988]

Page 27: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Normalization

Place a model into a canonical coordinate frame by normalizing for:• Translation• Scale: Mean variance• Rotation

Translation-Aligned Models Translation- and Scale-Aligned Models

[Horn et al., 1988]

Page 28: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Normalization

Place a model into a canonical coordinate frame by normalizing for:• Translation• Scale• Rotation: PCA alignment

Translation- and Scale-Aligned Models Fully Aligned Models

PCA Alignment

Page 29: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Rotation

Properties:• Translation and rotation normalization is

guaranteed to give the best alignment• Rotation normalization is ambiguous

PCA Alignment

Directions of the axes are ambiguous

Page 30: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Normalization (PCA)

PCA defines a coordinate frame up to reflection in the coordinate axes.• Make descriptor invariant to axial-reflections

Reflections fix the cosine term Reflections multiply the sine term by -1

zx

y

k

kk kbkaf )sin()cos()(

kkk ba ,

Translation Invariant Representation

Page 31: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Retrieval Results (Rotation)

Method Floats

Exhaustive Search 8192

PCA + Flip Invariance

8192

PCA 8192

Cylindrical PS 4352

Spherical PS 512

Time:Method Secs.

Exhaustive Search 20.59

PCA + Flip Invariance

.67

PCA .67

Cylindrical PS .32

Spherical PS .03

Size:

0%

50%

100%

0% 50% 100%

Exhaustive SearchPCA + Flip InvarianceCylindrical Power SpectrumPCASpherical Power Spectrum

Recall

Pre

cisi

on

Gaussian EDT

Page 32: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Alignment

• Exhaustive search: Best results Inefficient to match

• Normalization: Provably optimal for translation and scale Works well for rotation if models have well defined

principal axes and the directional ambiguity is resolved

• Invariance: Compact Efficient Often less discriminating

Page 33: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Outline

Matching• Alignment

Exhaustive Search Invariance Normalization

• Part vs. Whole Conclusion

Page 34: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Partial Shape Matching

Cannot use global normalization methods that depend on whole model information: Center of mass for translation Mean variance for scale Principal axes for rotation

Normalized Whole

Normalized Part

(Mis-)Aligned Models

Page 35: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Partial Shape Matching

Cannot use global normalization methods that depend on whole model information:Exhaustively search for best alignmentNormalize using local shape informationUse transformation invariant representations

Normalized Whole

Normalized Part

(Mis-)Aligned Models

Page 36: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Spin Images & Shape Contexts

Translation (Exhaustive Search):

Represent each database model by many descriptors centered at different points on the surface.

Model

Multi-Centered Descriptors

Page 37: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Spin Images & Shape Contexts

Translation (Exhaustive Search):

To match, center at a random point on the query and compare against the different descriptors of the target

Query Part Randomly-Centered Descriptor

Target Descriptor

Best Match

Page 38: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Spin Images & Shape Contexts

Rotation (Normalization):

For each center, represent in cylindrical coordinates about the normal vector• [Spin Images]: Store energy in each ring

• [Harmonic Shape Contexts]: Store power spectrum of each ring

• [3D Shape Contexts]: Search over all rotations about the normal for best match

n

nn

Page 39: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Spin Images & Shape Contexts

Spin images and shape contexts allow for part-in-whole searches by exhaustively searching for translation and using the normal for rotation alignment• [Spin Images]: Store energy in each ring

• [Harmonic Shape Contexts]: Store power spectrum of each ring

• [3D Shape Contexts]: Search over all rotations about the normal for best match

Image courtesy of Frome et al, 2003

Page 40: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Conclusion

Aligning Models:• Exhaustive Search• Normalization• Invariance

Partial Object Matching• Can’t use global normalization techniques

Translation: Exhaustive Search Rotation: Normal + Exhaustive/Invariant

Page 41: Alignment and Matching Thomas Funkhouser and Michael Kazhdan Princeton University Thomas Funkhouser and Michael Kazhdan Princeton University

Conclusion

Shape Descriptors and Model Matching:• Decoupling representation from registration

Can design and evaluate descriptors without having to solve the alignment problem

Can develop methods for alignment without considering specific shape descriptors