text extraction from digital image

23
Prepared By: Amit Bhoraniya (7022) Kaushik Godhani(7009) Mayur Halai(7016) Vikram Ghunsar(7039) Text Extraction From Image Guided By: Mr. Udesang Jaliya Mr. Kirti Sharma

Upload: kaushik-godhani

Post on 07-Nov-2014

596 views

Category:

Education


5 download

DESCRIPTION

Text Extraction is a process by which we convert Printed document/Scanned Page or Image in which text are available to ASCII Character that a Computer can Recognize.

TRANSCRIPT

Page 1: Text extraction From Digital image

Prepared By:Amit Bhoraniya (7022)

Kaushik Godhani(7009)Mayur Halai(7016)

Vikram Ghunsar(7039)

Text Extraction From Image

Guided By:Mr. Udesang Jaliya

Mr. Kirti Sharma

Page 2: Text extraction From Digital image

What is Text Extraction ??Text Extraction is a process by which

we convert Printed document/Scanned Page or Image in which text are available to ASCII Character that a Computer can Recognize.

Page 3: Text extraction From Digital image

Goal Of Project

GENERAL APTITUDEComputer ScienceElectronics & Communication Engineering

Page 4: Text extraction From Digital image

How Will We Archive That Goal ??

1Preprocessing

2Segmentation

3Recognition

Page 5: Text extraction From Digital image

Pre-Processing1

Page 6: Text extraction From Digital image

Pre-Processing

1Gray Scale 2Noise Removal 3Thresholding

Page 7: Text extraction From Digital image

Gray Scale

Page 8: Text extraction From Digital image

Noise Removal

Noise Removal is used to Enhance the ImageFor Enhancing We have used Median Filter

FilteredImage = Median Filter(Origional Image, FilterSize)We have used FilterSize [5,5]

Page 9: Text extraction From Digital image

Thresholding

Edge DetectionDilate ImageDetect Text Area Using HistrogramPersonal Thresholding to Text Area

Page 10: Text extraction From Digital image

Edge Detection using Canny

Page 11: Text extraction From Digital image

Dilate

Page 12: Text extraction From Digital image

Text Area Using Histrogram

Page 13: Text extraction From Digital image

Algorithm

• Row Histrogram• Separate Region by (no. of Pixel > 60 )• For Each Row

– Separate Region by (no. of Pixel > Height of (Row/4))

Page 14: Text extraction From Digital image

2 Segmentation

Page 15: Text extraction From Digital image

Segmentation

1Line Segmentation 2Word

Segmentation

3Character Segmentation

Page 16: Text extraction From Digital image

From above Image, Image are segment in to Different Lines, Below an example of Only For one Line.

TEXT SEGMENTATION

Page 17: Text extraction From Digital image

Find all the word than convert text area in one image

Segmentation

Character are separate from the word

Page 18: Text extraction From Digital image

3 Recognition

Page 19: Text extraction From Digital image

Recognization

1Feature Extraction 2Classifier

3Text Document

Page 20: Text extraction From Digital image

• Feature Extraction• Binary Code Method• Chain Code Method• PCA (Principle Component Analysis)• LDA (Linear Discriminative Image)

• Classifier• Artificial Neural Network• Support Vector Machine

Recognization

Page 21: Text extraction From Digital image

Applications• Banking (To read Credit Card)• Libraries (To convert Scanned Page to

Image)• Govt. Sector (Form Processing)• Used in Car Number Plate Recognition

System• Undesirable Text removal from images.

Page 22: Text extraction From Digital image

References

1. OCR for Devnagari Script by Mahesh Goyani2. Edge Based Text Extraction From Complex Images

by Xiaoqing Liu and Jagath Samarbandhu3. Automatic Text Detection using Morphological

Operations and Inpainting by Khyati Vaghela4. Font and Background Color Independent Text

Binarization by T.Kasar , J.Kumar , A.G. Ramkrishnan

Page 23: Text extraction From Digital image

Thank You