sublinear algorihms for big data

Sublinear Algorihms for Big Data

Grigory Yaroslavtsevhttp://grigory.us

Lecture 4

Slides are available at http://grigory.us/big-data.html

• Dimensionality reduction– AMS as dimensionality reduction– Johnson-Lindenstrauss transform

• Sublinear time algorithms– Definitions: approximation, property testing– Basic examples: approximating diameter, testing

properties of images– Testing sortedness– Testing connectedness

-norm Estimation

• Stream: updates that define vector where . • Example: For

• -norm:

-norm Estimation

• -norm: • Two lectures ago:– -moment – -moment (via AMS sketching)

• Space: • Technique: linear sketches– for random set s – for random signs

AMS as dimensionality reduction

• Maintain a “linear sketch” vector

, where • Estimator for :

• “Dimensionality reduction”: “heavy” tail

Normal Distribution

• Normal distribution – Range: – Density: – Mean = 0, Variance = 1

• Basic facts:– If and are independent r.v. with normal

distribution then has normal distribution

– If are independent, then

Johnson-Lindenstrauss Transform

• Instead of let be i.i.d. random variables from normal distribution

• We still have because:– “variance of “

• Define and define:

• JL Lemma: There exists s.t. for small enough

Proof of JL Lemma

• JL Lemma: s.t. for small enough

• Assume . • We have and

• Alternative form of JL Lemma:

Proof of JL Lemma

• Let and • For every we have:

• By Markov and independence of :

• We have , hence:

Proof of JL Lemma

• For every we have:

• Let and recall that • A calculation finishes the proof:

Johnson-Lindenstrauss Transform

• Single vector: – Tight: [Woodruff’10]

• vectors simultaneously: – Tight: [Molinaro, Woodruff, Y. ’13]

• Distances between vectors = vectors:

sublinear algorihms for big data

Documents

what can we do in sublinear time? 0368.4612 seminar on...

sublinear algorithms

asymmetric lsh (alsh) for sublinear time maximum inner...

sublinear time algorithms - people | mit csailronitt...

an introduction to chaining, and applications to sublinear

simpleflow: a non-iterative, sublinear optical flow...

sublinear expectations and martingales in discrete...

distributed learning with sublinear...

asymmetric lsh (alsh) for sublinear time maximum inner...

sublinear algorithms course...sublinear-time algorithms for...

sublinear algorithms lecture 23: april 20 10312013 …

secure computation with sublinear amortized...

limits of practical sublinear secure computation · limits...

networks cannot compute their diameter in sublinear time...

homomorphic signatures with sublinear public keys via...

ieee transactions on big data, vol. 2, no. 3...

linear-time arguments with sublinear verification from

sublinear-time algorithms - university of...

learning determinantal point processes in sublinear time

lowerboundsandhardnessmagniﬁcationfor sublinear