identification of prototypes for handwritten digits using ...nitish/iitk_page/se367/pres2.pdfnitish...

Post on 01-Aug-2020

9 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Identification of Prototypes for HandwrittenDigits using Manifold Learning

Nitish Srivastava Pradeep Karuturi

SE 367: Introduction to Cognitive ScienceDepartment of Computer Science and Engineering, IIT Kanpur

November 2, 2009

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Introduction

I Sensory perception receieved by humans is extremely highdimensional in terms of attributes.

I One possible way in which the brain could extract meaningfulinformation from this multitude of data is by mapping objectsto categories.

I This mapping is governed by a regularity in the features ofobjects belonging to the same category.

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Aim

I We aim to find the regularity that can be used to map allinstances of a digit to the same category and also find out thecentral instances of that category.

I Tangent distance as a distance metric between digit images.

I ISOMAP algorithm to generate a low dimensional embedding.

I find the manifolds for the entire dataset and also for eachdigit separately.

I find the prototypical image of each digit using the manifoldfor that digit

I validate the results of prototype selection from thevisualization of the manifold.

I identify peripheral images in the visualization and validatethat these are indeed far from prototypical.

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Tangent distance

I Tangent distance [1] is invariant to certain transformation inthe images, such as translation, rotation, scaling, shearing,squeezing etc.

I The tangent distance computes the distance shortest distancebetween two manifolds generated by the two correspondinginput images.

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Tangent distance

The manifolds are approximated as tangents to the manifold at theoriginal image point. The distance between the two images is thesmallest distance between the two tangents.

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Tangent distance example

Figure: Adapted from [1]. Left: Original image. Middle: 5 tangentvectors corresponding to: scaling, rotation, expansion of the X axis whilecompressing the Y axis, expansion of the first diagonal while compressingthe second diagonal and thickening. Right: 32 points in the tangentspace generated by adding or subtracting each of the 5 tangent vectors.

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

ISOMAP

I ISOMAP estimates the intrinsic geometry of a data manifoldbased on a rough estimate of each data points neighbors onthe manifold.

I Uses geodesic distance(i.e., shortest path on the kNN-graph)as the distance measure between two datapoints.

I It then embedds the pair-wise distances into alow-dimensional space using Multi-dimensional scaling(MDS).

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Separation of ‘4’ and ‘9’ using tangent distance

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Separation of ‘4’ and ‘9’ using tangent distance

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Separation of ‘4’ and ‘9’ using tangent distance

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Separation of ‘4’ and ‘9’ using L2 distance

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Separation of ‘1’ , ‘2’ and ‘7’ using tangent distance

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Separation of ‘1’ , ‘2’ and ‘7’ using tangent distance

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Separation of ‘1’ , ‘2’ and ‘7’ using L2 distance

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘2’

Similar to that obtained in [2]Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘2’

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘1’

Similar to that obtained in [2]Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘1’

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘4’

Similar to that obtained in [2]Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘4’

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘7’

Similar to that obtained in [2]Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Embedding of ‘7’

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

Bibliography

Patrice Y. Simard, Yann A. Le Cun, John S. Denker, andBernard Victorri.Transformation invariance in pattern recognition - tangentdistance and tangent propagation.In Lecture Notes in Computer Science, pages 239–274.Springer, 1998.

J. B. Tenenbaum, V. de Silva, and J. C. Langford.A global geometric framework for nonlinear dimensionalityreduction.Science, 290(5500):2319–2323, December 2000.

Nitish Srivastava, Pradeep Karuturi Identification of Prototypes for Handwritten Digits using Manifold Learning

top related