Entropy Analysis
of
Line Scan Signals
for Establishing the
Equivalence of Document
Images
11
Entropy Computation
s
Histogramming
Regression
Line Scan Signals
Equivalence Test
12
Generation of Line Scan Signals
Transforming a text document image’s line scan signals into ENTROPY
Row Transitions
Column Transitions
The structure of the text components are the foreground overlay on the back ground of the document image resulting in a 0/1 transition line scan signals.
‘0 - 1’ Text to Background transition (+ve)
‘1 – 0’ Background to Text transition (-ve)
Entropy – Measure of change in energy caused due to transitions. 13
Entropy is defined as :
E = p Log(1/p) + (1-p) Log(1/(1-p))
‘p’ Probable occurrence of transition.
‘(1-p)’ Non probable occurrence of transition.
Veronique Eglin and Stephane Bres, Document Page Similarity Based on Layout Visual Saliency: Application to Query By Example And Document Classification, International Conference on Document Analysis and Recognition, 2003.
For recognizing the unique structure of the components and to comprehend the content of the components at three hierarchical levels, in this research work, two new models namely,
Conventional Entropy Quantifier (CEQ)
Spatial Entropy Quantifier (SEQ)
have been developed originating with the entropy formulation
14
Conventional Entropy Quantifier (CEQ)
This is based on the number of times the transition occurs in each row and column of the image matrix.
))1/(1log()1()/1log()( pppptE ‘t’ is the transition from 0-1 and 1-0.
‘p’ is calculated based on the number of +ve and -ve transitions in each row or column divided by the total number of columns or rows respectively.
‘1-p’ is the probable non-occurrences of transition.
E(t) is the total entropy.
15
Development of the Computational Model
Depending upon the transition (1/0 or 0/1) as described above E(t) could be E-(t) or E+(t).
Let R represent the set of sequence of ‘m’ rows in a component and C represent the set of sequence of ‘n’ columns in a component.
R= {r / every r is a horizontal row of consecutively placed n pixels one after the other} and 1 m.
C={r / every r is a vertical column of consecutively placed m pixels one below the other} and 1 n.Total entropy along an th row is defined as
E() = E+() + E-()
where E+() = E+(t) at all packets of transitions scanning over ;E-() = E-(t) 1 n.
Hence the net horizontal entropy for a component is defined as EH = R E()
16
Similarly total entropy along a th column is
E() = E+() + E-()
Where E+() = E+(t) at all packets of transitions scanning over ; and E-() = E-(t) 1 m.
Hence the net vertical entropy for a component is EV = C E()
Finally the overall entropy of the component is
EO=EH +EVentropy feature is represented as a 4-dimentional feature set F1, F2, F3 and F4. where,
F1 stands for the total row entropy for transition 1-0 (E-( ))
F2 stands for the total row entropy for transition 0-1(E+( ))
F3 stands for the total column entropy for transition 1-0 (E-())
F4 stands for the total column entropy for transition 0-1(E+())
1-dimensional feature F5 where, F5=F1+F2+F3+F4 is the overall entropy EO
17
Component F1 F2 F3 F4 F5 difference
Hydrobia 5.23 5.23 22.58 22.58 55.63 0.52
Hydrobic 5.22 5.22 22.33 22.33 55.11
Impunity 5.21 5.21 14.58 14.58 39.59 1.11
Impurity 5.22 5.22 14.02 14.02 38.48
Metc 4.77 4.77 46.37 46.37 102.29 2.27
Meta 4.88 4.88 47.97 47.97 105.69
Component F1 F2 F3 F4 F5 difference
Hydrobia 5.08 5.12 26.59 25.60 62.40 1.46
Hydrobic 5.07 5.10 26.00 25.02 61.20
Impunity 5.05 5.08 16.45 16.18 42.76 1.62
Impurity 5.06 5.09 15.78 15.52 41.45
Metc 4.63 4.63 53.38 52.98 115.61 4.52
Meta 4.74 4.73 55.17 54.73 119.38
Distance between the features extracted with tight segmentation
Distance between the features extracted without tight segmentation
18
Experimental Analysis with Highly Resembling Finer Components.
Set Character F1 F2 F3 F4 F5
(0,o)0 5.33 5.33 2.17 2.17 14.99
O 4.28 4.28 3.72 3.72 16.00
(u,v)u 5.08 5.08 4.88 4.88 18.32
v 4.88 5.08 4.88 5.08 18.32
(b,p,d,q)
b 5.02 5.32 3.09 2.64 16.07
p 5.23 4.85 3.09 2.98 16.15
d 5.00 4.28 3.39 3.39 16.05
q 4.67 5.42 3.02 3.33 16.43
(6,9)6 5.14 5.06 2.73 2.90 15.83
9 5.51 5.14 2.79 2.84 16.28
Set of highly resembling finest components
19
Experimental Analysis with Variation in Font Size and Font Style
Font size
F1 F2 F3 F4 F5
8 3.6902 3.8274 7.3345 7.3216 22.1738
10 4.0664 4.337 8.4822 8.4495 25.3352
12 4.7016 4.8436 8.821 8.7091 27.0754
14 5.0523 5.1278 9.1954 9.1356 28.5111
16 5.3422 5.5107 10.1921 10.1921 31.237
18 5.6999 5.7931 10.9236 10.8178 33.2343
Feature vector for a component (word) in different font sizes.
Font style F1 F2 F3 F4 F5
TNR 5.1054 4.9964 6.9746 6.4661 23.5424
Arial 4.9007 5.7858 6.596 5.7773 23.0598
courier 4.0503 4.2283 9.5296 9.3543 27.1625
courierN 3.8731 4.0407 10.5304 9.8292 28.2734
Feature vector for a component (word) in different font styles.
20
Experimental Analysis with Medium (Line) Components
Line is a group of words. Experiments are conducted on lines with different combinations of fine level components.
Line 1: Radha went to school in the afternoon.Line 2: in the afternoon Radha went to school.Line 3: went to school Radha in the afternoon.Line 4: went to school in the afternoon Radha.Line 5: in the afternoon went to school Radha.
sample F1 F2 F3 F4 F5
Line1 5.28 5.28 77.16 77.16 164.86
Line2 5.28 5.28 77.16 77.16 164.86
Line3 5.2753 5.2753 77.1597 77.1597 164.86
Line4 5.2753 5.2753 77.1597 77.1597 164.86
Line5 5.2753 5.2753 77.1597 77.1597 164.86
Feature vector of medium components with tight segmentation
21
Experimental Analysis with Coarse (Paragraph) Components.
paragraph 1 Paragraph 2 Paragraph 3
sample F1 F2 F3 F4 F5
Para1 20.8874 20.9908 38.4377 37.719 118.0349
Para2 36.1343 36.4856 23.7066 23.2941 119.6205
Para3 23.9655 24.0126 33.4495 32.828 114.2556
Feature vector for coarse components with tight segmentation
22
Spatial Entropy Quantifier (SEQ)
The formulation of SEQ essentially emanates from CEQ, which is morphed to take an improved form.
position of transition is represented by pos.
Where pos is the row number in column transition and column number in row transition.
23
Development of the Computational Model
Consider a component of size m x n, obtained from any hierarchical level.
Let R represent the set of sequence of ‘m’ rows and C represents the set of sequence of ‘n’ columns.
R= {r / every r is a horizontal row of consecutively placed n pixels one after the other} and 1 m.
C= {c / every c is a vertical column of consecutively placed m pixels one below the other} and 1 n.
If the transition occurs in a row rα then it contributes to row Entropy at position pos and it is formulated as:
If the transition occurs in a column cβ then it contributes to column entropy at position pos and it is given by
)))))((log())((())log())(((()( posnmnnposmposnnposmrrE
)))))((log())((())log())(((()( posnmmmposnposmmposnccE
24
Total entropy for each row E(r) is the summation of entropy at +ve (E+( r)) and –ve (E-(r)) transitions in a particular row which is formulated as,
nn
rErErE )()()(
Similarly for each column,
mm
cEcEcE )()()(
Then the total horizontal entropy Eh(R) is given by,
nh rERE )()(
Similarly for vertical entropy Ev(C) is given by,
mv cECE )()(
Total entropy of the component is given by
)()()( CEREtE vh 25
Spatial-entropy quantifier ,
Position of occurrence of transition.
Based on the position of the component the entropy also changes.
A Hypothetical Experiment
Component (EMPTY) Position -1
Position -2 Position –3
26
Position-1
Position-3 Position-2
Features Features Features
F1 F2 F3 F4 F1 F2 F3 F4 F1 F2 F3 F4
-34.34 -34.37 -23.17 -23.28 -137.85 -137.97 -11.31 -11.35 -258.00 -258.23 -16.58 -16.63
-34.13 -34.16 -23.04 -23.15 -136.94 -137.04 -11.26 -11.30 -256.38 -256.57 -16.52 -16.57
-38.61 -38.67 -26.10 -26.16 -142.03 -142.28 -14.15 -14.18 -262.08 -262.53 -19.36 -19.39
-38.40 -38.46 -25.98 -26.01 -141.20 -141.44 -14.10 -14.11 -260.65 -261.04 -19.30 -19.31
-42.86 -42.96 -28.96 -29.03 -146.20 -146.60 -16.97 -17.00 -266.16 -266.84 -22.10 -22.14
-42.73 -42.76 -28.90 -28.93 -145.72 -145.84 -16.94 -16.95 -265.32 -265.52 -22.07 -22.09
-42.67 -42.70 -31.82 -31.90 -145.50 -145.61 -19.78 -19.81 -264.92 -265.12 -24.85 -24.89
-47.22 -47.26 -34.72 -34.76 -150.77 -150.91 -22.60 -22.62 -270.90 -271.14 -27.61 -27.63
-47.11 -47.18 -37.57 -37.70 -150.37 -150.63 -25.41 -25.47 -270.23 -270.67 -30.35 -30.42
-47.04 -47.07 -40.60 -40.64 -150.13 -150.25 -28.30 -28.33 -269.81 -270.02 -33.18 -33.21
-51.31 -51.43 -40.41 -40.50 -154.42 -154.80 -28.20 -28.25 -274.09 -274.74 -33.08 -33.13
-55.81 -55.85 -43.55 -43.60 -159.38 -159.53 -31.16 -31.19 -279.50 -279.75 -35.98 -36.00
-55.72 -55.76 -43.25 -43.35 -159.10 -159.24 -31.00 -31.05 -279.04 -279.27 -35.81 -35.86
-55.55 -55.63 -46.40 -46.56 -158.58 -158.84 -33.96 -34.06 -278.16 -278.59 -38.71 -38.80
-55.47 -55.51 -46.09 -46.29 -158.34 -158.46 -33.78 -33.90 -277.74 -277.95 -38.54 -38.65
-60.05 -60.15 -163.54 -163.84 -283.56 -284.05
-59.73 -59.87 -162.62 -163.00 -282.02 -282.65
-64.39 -64.45 -168.00 -168.15 -288.10 -288.35
-64.00 -64.09 -166.90 -167.15 -286.29 -286.72
-68.69 -68.74 -172.31 -172.47 -292.40 -292.66
-68.27 -68.32 -171.17 -171.31 -290.56 -290.78 27
Experimental Analysis with Highly Resembling Components
CEQ F1 F2 F3 F4 F5 Distance(F5)
U 4.8765 4.8765 2.4371 3.69 15.88010
V 4.8765 4.8765 2.4371 3.69 15.8801
SEQ
U 278.18 275.13 66.66 169.14 789.1000.99
V 277.59 275.10 68.54 168.66 789.88
CEQ F1 F2 F3 F4 F5 Distance(F5)
intended 4.69 4.72 21.91 19.2 50.720
indented 4.69 4.72 21.91 19.2 50.72
SEQ
intended 1657.4 1671.3 7707 5546.9 16582.60.2
indented 1656.5 1670.4 7793.4 5462.1 16582.4
Entropy values extracted using CEQ and SEQ
28
Sample TNR cap
O
Arial cap
O
TNR small
O
Arial small
o
TNR digit
0
Arial digit
0
TNR cap - O0 167.4 958.7 645.32 449.2 192
Arial cap - O167.4 0 1126.1 812.72 616.6 359.4
TNR small l – o958.7 1126.1 0 313.38 509.5 768.7
Arial small -o645.32 812.72 313.38 0 196.12 453.32
TNR digit - 0449.2 616.6 509.5 196.12 0 257.2
Arial digit -0192 359.4 768.7 453.32 257.2 0
Distance using feature F5 extracted using SEQ
Experimental Analysis with Variation in Font Size and Font Style.
29
Experimental Analysis with Fine (word) Components
Sample F1 F2 F3 F4 F5Distance using F5
but 680.93 647.51 709.80 472.91 2,511.20
8.24tub 664.00 617.77 661.07 485.95 2,428.80
hydrobia 2970.1 2972.9 17955 18224 42122
2761hydrobic 2899.5 2890.1 16651 16920 39361
impunity 2093.5 2121.1 7263.3 6871.7 18350
1489impurity 1991.1 2018.8 6614.6 6236.2 16861
bombay 1830.9 1819.3 7262.9 7031.6 17945
549dombay 1802 1849.5 6977.7 6767.1 17396
intended 1657.4 1671.3 7707 5546.9 16583
1indented 1656.5 1670.4 7793.4 5462.1 16582
korna 1089.4 1091.6 3411 1958.1 7550
185korne 1026.1 1002 3275.2 2061.8 7365
Distance between words using SEQ
30
Features extracted through CEQ Features extracted through SEQ.
Distance using F5 extracted through CEQ Distance using F5 extracted through SEQ
31
Experimental Analysis with Medium (line) Components
Line 1: Radha went to school in the afternoon.Line 2: in the afternoon Radha went to school.Line 3: went to school Radha in the afternoon.Line 4: went to school in the afternoon Radha.Line 5: in the afternoon went to school Radha.
Feature vectors for 5 line samples using CEQ Feature vector for 5 line samples using SEQ
Distance using F5 extracted through CEQ Distance using F5 extracted through SEQ
32
Wavesim Transform:
A Novel Perspective Of
Wavelet Transform For Signals
The CWT is essentially a collection of inner products of a signal
f(t) and the translated and dilated wavelet for all a and b
where the values W(a,b) represents the correlation coefficients:
CWT has a cross correlation interpretation as well.
−∞
The WaveSim(WS) transform of f(t) with
respect to a wavelet Ψa,b(t) is defined as
where Sim(.,.) is the similarity measure
computed as illustrated in figure with a
Haar Wavelet. The amplitude of Ψ is
max(f(t)).
Wavesim Transform
A1 = Area Covered by the positive half of the Haar WaveletA2 = Area Covered by the negative half of the Haar WaveletA3 = Positive area spanned by f(t) within the positive half of the wavelet.A4 = Negative area spanned by f(t) within the negative half of the waveletA5 = Negative area spanned by f(t) within the postive scale range of the waveletA6 = Positive area spanned by f(t) within the negative scale range of the wavelet
(A3/A1) is the similarity measure between the positive half of the wavelet and f(t)
in the positive scale range, (A5/A1) is the dissimilarity measure between the
positive half of the wavelet and f(t) in the positive scale range.
Likewise (A4/A2) is the similarity measure between the negative half of the
wavelet and f(t) in the negative scale range, (A6/A2) is the dissimilarity measure
between the negative half of the wavelet and f(t) in the negative scale range.
The WaveSim coefficient is computed as follows
Sim(f(t), Ψ(t)) = ½((A3 – A5)/A1 + (A4 – A6)/A2))
½ helps in normalizing the coefficient values to lie in the interval [-
1,1].
Case 1: Input signal is a set of 200 samples with equal values (f(t) is a horizontal straight line)
Case 2: Input signal is a set of 200 samples with the first 100 samples holding a value 20 and the second 100 samples holding a value 80.
Case 3: Input signal is a set of 200 samples linearly increasing from 1 to 200. (f(t) is a straight line)
Case 4: Input signal is a set of 200 samples with a curved structure.
Adaptive WaveSim
Thank You