text detection from image

24
“A Novel Text detection System Based on Character and Link Energies” Presented by: Arun Patel Roll No.: 15EC65R18 M.Tech 1 st year VIPES, IIT Kharagpur 1

Upload: arun-patel

Post on 15-Apr-2017

209 views

Category:

Engineering


0 download

TRANSCRIPT

Page 1: Text Detection From Image

1

“A Novel Text detection System Based on Character and Link Energies”

Presented by: Arun PatelRoll No.: 15EC65R18

M.Tech 1st year VIPES, IIT Kharagpur

Page 2: Text Detection From Image

2

Algorithm• This algorithm can detect most text object in various condition including different lightening,

different colors, complex background and low contrast text.• This method is robust to the font, size, color and orientation of text and discriminate text object

from others effectively.

Fig(1) Algorithm

Page 3: Text Detection From Image

3

Initialization of Candidate Text Objects

Localize the candidate Part Euler number Let and be two candidate parts with widths W and Wvj , heights Hvi and Hvj , and centroids Cvi

and Cvj ;

dist.(Cvi ,Cvj)wd .min(max(Wvi ,Hvi), max(Wvj ,Hvj)) Finally, the candidate character parts that are reachable by one another via one or more links are

grouped to form a candidate text.

Page 4: Text Detection From Image

4Fig.2 Initialization of candidate text objects

Page 5: Text Detection From Image

5

Character Features

• One important characteristic that discriminate text object from other object is that character are made up of strokes that typically have approximately uniform thickness resulting in two near parallel edges sets in their boundaries.

• Two edges sets have high similarities in length, orientation and curvature.• Similarities of two stroke edges is captured by gradient vector of each point on the boundary.

Fig.3 (a) edge pairs of strokes (b)Gradient vectors of ‘R’

Page 6: Text Detection From Image

6

• For a character, it has two near parallel edges sets and the gradients of an edge point and its corresponding point should have approximately opposite direction.

• Distance between the points and their corresponding are similar because the change of stroke width is usually small.

Fig.4 Corresponding pairs and links

Page 7: Text Detection From Image

7

Average Angle Difference of Corres. Pairs(Dangle)

• Let N denote the number of edge points of a candidate part. P(i)(1 ≤ i ≤ N) is the ith edge point with the corresponding point P(i) corr .The difference of the gradient directions of the corresponding pair (P(i) , P(i)

corr.) is defined as:

=abs(-)

• Dangle measures the average gradient direction difference of all corresponding pairs of a candidate part.

• Dangle =

• For an ideal character Dangle reaches the maximum value 1.

Page 8: Text Detection From Image

8

Fraction of non-noise pair (Fnon-

noise) • In some cases, however, a character may have a smaller due to noise or deformations. We

compute Fnon−noise to measure the noise and deformation levels of a part based on d (i)

angle .

• Fnon-noise =

• h(,)=1 if d (i)

angle >β

=0 else

• Fnon-noise is the fraction of all pairs for which the angle difference d (i)

angle is greater than β.

Fig5.Noise connections and non-noise connections Ref.(1)

Page 9: Text Detection From Image

9

R A C E

Dangle 0.889 0.865 0.925 0.897Fnon-noise 0.754 0.684 0.897 0.806

Fig.6 Dangle and Fnon-noise Ref. (1)

Page 10: Text Detection From Image

10

• we divide the non-noise connections into two types: stroke-length connection and stroke-width connection.

• By doing so, we can separate circle like objects and compute the feature vector of stroke width.

• Let k(i)(1 ≤ i ≤ N) be one of N non-noise connections of a part and have Ik(i) intersections with other

non-noise connections. We define stroke-length connection and stroke-width as follows:• K(i)

• For circle, every connection intersects with all other connections at its center. Hence, all non-noise connection of a circle are stroke length connection.

• Character have much more stroke-width connection than the non-characters.

Page 11: Text Detection From Image

11Fig 7. Percentages of stroke-width links of two example images. Ref.(1)

Page 12: Text Detection From Image

12

Vector of Stroke Width ()• The vector of stroke width Vwidth is defined as: =, .• Characters typically have one or two dominating stroke widths depending on their fonts.

• Then, we estimate dominating stroke-width w(i)d through a weighted average computation using w(i)

p and its two immediately adjacent neighbors:

=

Fig.8 Histogram of the lengths of stroke width connections Ref. (1)

Page 13: Text Detection From Image

13

Character Energy• For a part vi , we consider that its and are equally important for text detection and define the

character energy of vi as follows: = ,0 1.

• It can be treated as a measure of the probability that vi is a character.

• Character have larger Echar can discriminate text objects from other objects and it is robust to the font,size,color and orientation of characters.

• and are correlated.

Page 14: Text Detection From Image

14

(a) (b)

Fig.9 two character with different noise/deformation levels Ref. (1)

(a) 0.8846 0.5950 0.5950

(b) 0.8847 0.5261 0.7054

Page 15: Text Detection From Image

15Fig.10 character energy Ref. (1)

Page 16: Text Detection From Image

16

Link energy• Link energy is computed for every candidate link to measure the probability that two parts

connected by the link are both characters.• Link energy is computed by measuring two values: 1. Similarity in the properties of neighboring parts, such as the color, stroke width, and size. 2.Spatial consistency in the direction and distance between neighboring parts in a string of parts.

• For two connected parts vi and vj ,we use color, stroke width(Vwidth),character width, and character height to capture similarities between them.

=.)

• Higher the higher the similarities between two parts.

Page 17: Text Detection From Image

17

Similarity Computation Of two Character

Fig.11 Link energy Ref.(1)

colour =

Vwidth ==

Character width

, =

Character Height

=Simi(), =

Page 18: Text Detection From Image

18

Text Unit Energy • For the text unit containing two parts vi and v j , the text unit energ is computed using character

energies and link energy :• =[()+ ]• To refine the detected text objects, text units whose text unit energies are smaller than a pre-

defined threshold Ttext are removed from the text objects.• choice of this threshold depends upon the characteristic of the datasets, a threshold of of 0.7

worked well for several datasets used for testing this algorithm.

Page 19: Text Detection From Image

19Fig.12 Text energy Ref(1)

Page 20: Text Detection From Image

20Fig.13 Threshold Etext and txt detection outputs Ref(1)

Page 21: Text Detection From Image

21

Result on ICDAR 2003/2005 Dataset Objects

Fig.14 experimental outputs Ref.(2)

Page 22: Text Detection From Image

22

Evalution ResultsAlgorithm Precision

Ashida 0.55

Hinneck Becker 0.62

SWT 0.73

Novel Text detection 0.74

Recall

0.46

0.67

0.60

0.69

Page 23: Text Detection From Image

23

References:• [1] Jing Zhang and Rangachar Kasturi ,“A novel text detection system based on characters

and link energies”, image processing, IEEE trans., vol.23, No.9, pp.4187-4198, September 2014.• [2] S.M.Lucas, A. Panaretos, L.Sosa,A. Tang, S. Wong, and R. Young, “ICDAR 2003 robust reading

competitions”, in Proc. 7th Int. Conf. Document And Recognit.,vol.2,pp. 682,2003.• [3]D,.Marr and Hildreth, “Theory of edge detection,” Proc.Roy.Soc. London B,vol.

207,No.1167,pp. 187-217,1980.

Page 24: Text Detection From Image

24

Thank You