robust pca - nus computingleowwk/cs6101/ay2012-13 sem... · 2012. 9. 18. · rpca reconstructed...

Robust PCA

Yingjun Wu

Preliminary: vector projection

Scalar projection of a onto b:

a1 could be expressed as:

b=(10,4)

w=(1,0)

Example

Presenter

Presentation Notes

Norm a multiplied by cos theta a multiplied by b hat b hat means unit vector in the direction of b.

Preliminary: understanding PCA

Preliminary: methodology in PCA

• Purpose: project a high-dimensional object onto a low-dimensional subspace.

• How-to: – Minimize distance;

– Maximize variation.

Presenter

Presentation Notes

Essence of project: reduce redundant information. Feature to preserve: features that are most unique when compared with others. Suppose build a software to differentiate men/women. What feature you would like to use? Figure. Face.

Preliminary: math in PCA

• Minimize distance

Energy function

Compress

Recover

Preliminary: PCA example

• Original figure

RGB2GRAY

• Do something tricky:

compress decompress

• Do something tricky:

compress decompress

Feature#=1900

Feature#=500

Feature#=10 Feature#=50

Presenter

Presentation Notes

Featuer#=# of principal components

Preliminary: problem in PCA

PCA fails to account for outliers.

Reason: use least squares estimation.

robust PCA

One version of robust PCA: L. Xu et.al’s work.

Mean idea: regard entire data samples as outliers.

Samples are rejected!

robust PCA

Xu’s work modified the energy function slightly and penalty is added.

If Vi=1 the sample di is taken into consideration, otherwise it is equivalent to discard the sample.

Penalty item

robust PCA

Another version of robust PCA: Gabriel et.al’s work, or called weighted SVD.

Mean idea: do not regard entire sample as outlier. Assign weight to each feature in each sample. Outlier features could be assigned with less weight.

robust PCA

Weighted SVD also modified the energy function slightly.

Original feature Decompressed feature Weight

robust PCA

Flaw of Gabriel’s work: cannot scale to very high dimensional data such as images.

Flaw of Xu’s work: useful information in flawed samples is ignored; least squares projection cannot overcome the problem of outlier.

robust PCA

To handle the problem in the two methods, a new version of robust PCA is proposed.

Still try to modify the energy function of PCA…

Penalty

Distance Scale of error

Outlier process Xu’s work

Presenter

Presentation Notes

Sigma controls the convexity of the robust function and is used for deterministic annealing in the optimization process

robust PCA

To handle the problem in the two methods, a new version of robust PCA is proposed.

Still try to modify the energy function of PCA…

Increase without bound! Error rejected!

Presenter

Presentation Notes

X denotes how far does the error from the normal value, y denotes distance Normal method uses least-squared distance so that mistakes resulted from large error is out of bound. Using outlier process, error is rejected.

Experiments Four faces, the second face is contaminated.

Learned basis images.

PCA Xu RPCA

Reconstructed faces.

Presenter

Presentation Notes

The basis images could be used to reconstruct the original images. Original PCA do not reject outliers, so the learned basis is not accurate. Xu’s work reject the second sample entirely, so the second face could not be reconstructed.

Experiments

Original video

RPCA

PCA

Recent works

• John Wright et.al proposed a new version of RPCA.

• Problem: assume a matrix A is corrupted by error or noise, if we observed D, how to recover A?

Observed matrix

Linear operator

Original matrix error

Recent works

Presenter

Presentation Notes

The optimization problem is intractable and of little practical use. Solve it using a convex approximation method

Recent works

Presenter

Presentation Notes

Identiy operator, or linear operator. One pixel maps to a single pixel. ||E||_0 is the counting norm (the number of non-zero entries in the matrix) Using convex relaxation instead.

Robust PCA demo

Presenter

Presentation Notes

Identiy operator, or linear operator. One pixel maps to a single pixel. ||E||_0 is the counting norm (the number of non-zero entries in the matrix) Using convex relaxation instead.

References

• De la Torre, F. et.al, Robust principal component analysis for computer vision, ICCV 2001

• M. Black et.al, On the unification of line processes, outlier rejection, and robust statistics with applications in early vision, IJCV 1996

• D. Geiger et.al, The outlier process, IEEE workshop on NNSP, 1991

• L. Xu et.al, Robust principal component analysis by self-organizing rules based on statistical physics approach, IEEE trans. Neural Networks, 1995

• John Wright et. al, Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Matrices by Convex Optimization, NIPS 2009

• Emmanuel Candes et.al, Robust Principal Component Analysis?, Journal of ACM, 2011

Thank you!

robust pca - nus computingleowwk/cs6101/ay2012-13 sem... · 2012. 9. 18. · rpca reconstructed...

Documents

liz sim - suny downstate medical center · chest tube and...

q2 2019 results: webcast presentation...2019/08/01 ·...

powerpoint presentation...study area jointly developed by...

爀䴀椀氀椀琀愀爀礀欀渀椀瘀攀猀愀爀攀...

powerpoint presentation - umkc wordpress...we use moodle...

mid-america regional council (marc)...5 transit agencies \...

legal analysis and writing...legal analysis and writing...

爀伀甀爀洀椀猀猀椀漀渀椀猀...

time for a...

powerpoint presentation...these may incl\ de the...

amc powerpoint presentation template - sae.org · captured...

branding and identitybranding and identity jenn jones, cae,...

getting savvy at sasi - amazon web services...online:\爀...

stakeholder meeting - wake county, north carolina...

acrobat x and pdf/a standardsmark\爀吀栀愀渀欀猀...

règlementation, intensification et changement climatique...

爀䐀攀猀瀀椀琀攀琀栀攀...

revised total coliform ule - mwua...topics to be covered...

powerpoint presentation ppt...\爀吀栀椀猀椀猀...

high functioning teams in primary care · group-team...