research article text extraction from historical document

11
Research Article Text Extraction from Historical Document Images by the Combination of Several Thresholding Techniques Toufik Sari, 1 Abderrahmane Kefali, 1,2 and Halima Bahi 1 1 LabGED Laboratory, Badji Mokhtar, BP 12, 23000 Annaba, Algeria 2 08 May 1945 University, BP 401, 24000 Guelma, Algeria Correspondence should be addressed to Abderrahmane Kefali; [email protected] Received 13 June 2014; Revised 4 September 2014; Accepted 10 September 2014; Published 29 September 2014 Academic Editor: Chengcui Zhang Copyright © 2014 Toufik Sari et al. is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. is paper presents a new technique for the binarization of historical document images characterized by deteriorations and damages making their automatic processing difficult at several levels. e proposed method is based on hybrid thresholding combining the advantages of global and local methods and on the mixture of several binarization techniques. Two stages have been included. In the first stage, global thresholding is applied on the entire image and two different thresholds are determined from which the most of image pixels are classified into foreground or background. In the second stage, the remaining pixels are assigned to foreground or background classes based on local analysis. In this stage, several local thresholding methods are combined and the final binary value of each remaining pixel is chosen as the most probable one. e proposed technique has been tested on a large collection of standard and synthetic documents and compared with well-known methods using standard measures and was shown to be more powerful. 1. Introduction Binarization is an important step in the process of document analysis and recognition. It has as goal to segment the image into two classes (foreground and background in the case of document images). e resulting image is a binary image in black and white where the black represents the foreground and the white represents the background. In fact, document image binarization is critical in the sense that bad separation will cause the loss of pertinent information and/or add useless information (noise), generating wrong results. is difficulty increases for old documents which have various types of damages and degradations from the digitization process itself, aging effects, humidity, marks, fungus, dirt, and so forth, making the automatic processing of these materials difficult at several levels. A great number of techniques have been proposed in the literature for the binarization of gray-scale or colored documents images, but no one between them is generic and efficient for all types of documents. e binarization techniques of grayscale images may be classified into two categories: global thresholding and local thresholding [1, 2]. Another category of hybrid methods can be added [3]. e global thresholding methods are widely used in many document image analysis applications for their simplicity and efficiency. However, these methods are powerful only when the original documents are of good quality and well contrasted and have a clear bimodal pattern that separates foreground text and background. For historical document images which are generally noisy and of poor quality, global thresholding methods become not suitable because no single threshold is able to completely separate the foreground from the background of the image since there is no sufficient distinction between the gray range of background and foreground pixels. is kind of document requires a more detailed analysis, which may be guaranteed by local methods. Local methods calculate a different threshold for each pixel based on the information of its neighborhoods. ese methods are more robust against uneven illumination, low contrast, and varying colors than global ones, but they are very time consuming since a separate threshold is computed for each pixel of the image considering its neighborhoods. is calculation becomes slower when increasing the size of neighborhood considered. Hybrid Hindawi Publishing Corporation Advances in Multimedia Volume 2014, Article ID 934656, 10 pages http://dx.doi.org/10.1155/2014/934656

Upload: others

Post on 03-Apr-2022

7 views

Category:

Documents


0 download

TRANSCRIPT

Research ArticleText Extraction from Historical Document Images bythe Combination of Several Thresholding Techniques

Toufik Sari1 Abderrahmane Kefali12 and Halima Bahi1

1 LabGED Laboratory Badji Mokhtar BP 12 23000 Annaba Algeria2 08 May 1945 University BP 401 24000 Guelma Algeria

Correspondence should be addressed to Abderrahmane Kefali kefalilabgednet

Received 13 June 2014 Revised 4 September 2014 Accepted 10 September 2014 Published 29 September 2014

Academic Editor Chengcui Zhang

Copyright copy 2014 Toufik Sari et al This is an open access article distributed under the Creative Commons Attribution Licensewhich permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited

This paper presents a new technique for the binarization of historical document images characterized by deteriorations and damagesmaking their automatic processing difficult at several levels The proposed method is based on hybrid thresholding combining theadvantages of global and local methods and on the mixture of several binarization techniques Two stages have been included Inthe first stage global thresholding is applied on the entire image and two different thresholds are determined from which the mostof image pixels are classified into foreground or background In the second stage the remaining pixels are assigned to foregroundor background classes based on local analysis In this stage several local thresholding methods are combined and the final binaryvalue of each remaining pixel is chosen as the most probable one The proposed technique has been tested on a large collection ofstandard and synthetic documents and compared with well-known methods using standard measures and was shown to be morepowerful

1 Introduction

Binarization is an important step in the process of documentanalysis and recognition It has as goal to segment the imageinto two classes (foreground and background in the case ofdocument images) The resulting image is a binary image inblack and white where the black represents the foregroundand the white represents the background In fact documentimage binarization is critical in the sense that bad separationwill cause the loss of pertinent information andor adduseless information (noise) generating wrong results Thisdifficulty increases for old documents which have varioustypes of damages and degradations from the digitizationprocess itself aging effects humidity marks fungus dirt andso forth making the automatic processing of these materialsdifficult at several levels

A great number of techniques have been proposed inthe literature for the binarization of gray-scale or coloreddocuments images but no one between them is generic andefficient for all types of documents

The binarization techniques of grayscale images may beclassified into two categories global thresholding and local

thresholding [1 2] Another category of hybrid methodscan be added [3] The global thresholding methods arewidely used in many document image analysis applicationsfor their simplicity and efficiency However these methodsare powerful only when the original documents are ofgood quality and well contrasted and have a clear bimodalpattern that separates foreground text and background Forhistorical document images which are generally noisy andof poor quality global thresholding methods become notsuitable because no single threshold is able to completelyseparate the foreground from the background of the imagesince there is no sufficient distinction between the grayrange of background and foreground pixels This kind ofdocument requires a more detailed analysis which may beguaranteed by local methods Local methods calculate adifferent threshold for each pixel based on the informationof its neighborhoodsThese methods are more robust againstuneven illumination low contrast and varying colors thanglobal ones but they are very time consuming since a separatethreshold is computed for each pixel of the image consideringits neighborhoods This calculation becomes slower whenincreasing the size of neighborhood considered Hybrid

Hindawi Publishing CorporationAdvances in MultimediaVolume 2014 Article ID 934656 10 pageshttpdxdoiorg1011552014934656

2 Advances in Multimedia

methods in contrast combine global and local informationfor segmenting the image

In this paper we propose a new hybrid thresholdingtechnique for binarizing images of historical documentsThe proposed approach uses a mixture of thresholdingmethods and it combines the advantages of the two familiesof techniques the computation speed and the efficiencyThe remainder of this paper is organized as follows InSection 2 we present some existing binarization methodsThen in Section 3 we describe the proposed approach Theexperiments performed and the results will be shown inSection 4 before concluding

2 State of the Art

According to [4] existing methodologies for image bina-rization may be divided under two main strategies groupingbased and thresholding based Thresholding based methodsuse global or local threshold(s) to separate the text from thebackground In grouping based methods we distinguish twocategories region based grouping and clustering based group-ing methods Region based grouping methods are mainlybased on spatial-domain region growing or on splitting andmerging Also clustering based grouping methods are basedon classification of intensity or color values as a functionof a homogeneity criterion However several techniqueshave been employed to reach this classification K-meansalgorithm artificial neural networks and so forth

Sezgin and Sankur [5] established a classification ofbinarization methods according to the information that theyexploit in 6 categories

(i) Histogram-based methods the methods of this classperform a thresholding based on the form of thehistogram

(ii) Clustering-based methods these methods assign theimage pixels to one of the two clusters object andbackground

(iii) Entropy-based methods these algorithms use theinformation theory to obtain the threshold

(iv) Object attribute-based methods they find a thresh-old value based on some similarity measurementsbetween original and binary images

(v) Spatial binarization methods they find the optimalthreshold value taking into account spatial measures

(vi) Locally adaptive methods these methods aredesigned to give a new threshold for every pixelSeveral kinds of adaptive methods exist We findmethods based on local gray range local variationand so forth

Wepresent in this section some binarizationmethods themost frequently cited in the literature and we consider onlythresholding based methods

21 Global Methods Note 119868 the grayscale image of whichthe intensities vary from 0 (black) to 1 (white) and 119867 its

histogram of intensities The number of pixels having a graylevel 119894 is denoted as119867(119894)

211 Otsursquos Method Otsursquos method [6] tries to find thethreshold 119879 which separates the gray-level histogram inan optimal way into two segments (which maximize theintersegments variance or whichminimize the intrasegmentsvariance) The calculation of the interclasses or intraclassesvariances is based on the normalized histogram 119867

119899=

[119867119899(0) sdot sdot sdot 119867

119899(255)] of the image where sum119867

119899(119894) = 1

The interclasses variance for each gray level 119905 is given by

119881inter = 1199021(119905) times 119902

2(119905) times [120583

1(119905) minus 120583

2(119905)]2 (1)

such that

1205831(119905) =

1

1199021(119905)

119905minus1

sum

119894=0

119867119899(119894) times 119894

1205832(119905) =

1

1199022(119905)

255

sum

119894=119905

119867119899(119894) times 119894

1199021(119905) =

119905minus1

sum

119894=0

119867119899(119894) 119902

2(119905) =

255

sum

119894=119905

119867119899(119894)

(2)

212 ISODATA Method Thresholding using ISODATA [7]consists in finding a threshold by separating iteratively thegray-level histogram into two classes with the a prioriknowledge of the values associated with each class Thismethod starts by dividing the interval of nonnull values ofthe histogram into two equidistant parts and next we take119898

1

and 1198982as the arithmetic average of each class Repeat until

convergence the calculation of the optimal threshold 119879 as theclosest integer to (119898

1+1198982)2 and update the two averages119898

1

and1198982

213 Kapur et alrsquos Method Kapur et alrsquos method [8] isan entropy based method which takes into account theforeground likelihood distribution 119875

119891and the background

likelihood distribution (119875119887= 1 minus 119875

119891) in the determination of

the division entropy The binarization threshold 119879 is chosenfor which the value 119864 = 119864

119891+ 119864119887is maximal such that

119864119891= minus

119905

sum

119894=0

119875119894

119875119891

times log(119875119894

119875119891

)

119864119887= minus

255

sum

119894=119905+1

119875119894

1 minus 119875119891

times log(119875119894

1 minus 119875119891

)

(3)

where 119875119894is the occurrence probability of the gray level 119894 in the

image and 119875119891= sum119905

119894=0119875119894

214 Iterative Global Thresholding (IGT) The proposedmethod selects a global threshold to the entire image basedon an iterative procedure [9] At each iteration 119894 the followingsteps are performed

(a) calculating the average gray level (119879119894) of the image

(b) subtracting 119879119894from all pixels of the image

Advances in Multimedia 3

(c) histogram equalization to extend the pixels over thewhole gray levels interval

The algorithm stops when |119879119894minus 119879119894minus1| lt 0001

22 LocalMethods Localmethods compute a local thresholdfor each pixel by sliding a square or rectangular window overthe entire image

221 Bernsenrsquos Method It is an adaptive local method [10]Thus for each pixel of coordinates (119909 119910) the threshold isgiven by

119879 (119909 119910) =

119885low + 119885high

2

(4)

such that 119885low and 119885high are the lowest and the highest graylevels respectively in a squared window119908times119908 centered overthe pixel (119909 119910)

However if the local contrast 119862(119909 119910) = (119885high minus 119885low) isbelow a threshold 119897 (119897 = 15) then the neighborhood consistsof a single class foreground or background

222 Niblackrsquos Method The local threshold 119879(119909 119910) is calcu-lated using the mean119898 and standard deviation 120590 of all pixelsin the window (neighborhood of the pixel in question) [11]Thus the threshold 119879(119909 119910) is given by

119879 (119909 119910) = 119898 + 119896 times 120590 (5)

such that 119896 is a parameter used for determining the numberof edge pixels considered as object pixels and takes a negativevalues (119896 is fixed minus02 by authors)

223 Sauvola and Pietikainenrsquos Method Sauvola andPietikainenrsquos algorithm [3] is a modification of that ofNiblack in order to give more performance in the documentswith a background containing a light texture or too variationand uneven illumination In the modification of Sauvola thelocal binarization threshold is given by

119879 (119909 119910) = 119898 times (1 minus 119896 times (1 minus120590

119877)) (6)

where 119877 is the dynamic range of the standard deviation 120590 andthe parameter 119896 takes positives values in the interval [02 05]

224 Nick Method This method improves considerably thebinarization of lighted images and low contrasted images bydownwards moving the threshold of binarization [2] Thethreshold calculation is done as follows

119879 (119909 119910) = 119898 + 119896radic(sum1199012

119894minus 1198982

)

NP(7)

such that 119896 is the Niblack factor and varies between minus01 andminus02 according to the application need119898 is the average graylevel 119901

119894is the gray level of pixel 119894 and NP is the total number

of pixels In their tests the authors used a window of size 19times19

225 Sari et alrsquos Method This method uses an artificialneuron network of multilayer perceptron (MLP) type toclassify the image pixels into two classes foreground andbackground [12] The MLP has one hidden layer 25 inputsand one single output To assign a new value (black orwhite) to a pixel the MLP takes as input a vector of 25values corresponding to the intensities of the pixel in a5 times 5 window centered on the processed pixel The MLPparameters (structure the input statistics etc) have beenchosen after several experiments

23 Hybrid Methods

231 Improved IGTMethod Thismethod [13] is an improve-ment of IGT technique from [9] and it consists of two passesIn the first pass a global thresholding is applied to the entireimage and in the second pass a local thresholding processesareas still containing noise To do this the binary imageresulting from global thresholding is divided into severalsegments of size 119899 times 119899 and for each segment the frequency119891 of black pixels is calculated The segments satisfying thefollowing criteria are kept 119891 gt 120583 + 119896 times 120590 such that 120583 and120590 denote the mean and the standard deviation of the blackpixels frequency in the segment and 119896 is a constant (equal to2 according to the authors) For each detected area the IGTmethod is applied to the corresponding area in the originalimage Areas of size 50times50 give good results according to theauthors

232 Gangamma and Srikantarsquos Method Gangamma andSrikanta [14] proposed amethod based on a simple and effec-tive combination of spatial filters with gray scale morpho-logical operations to remove the background and improvethe quality of historical document image of palm scripts Thefirst step of this technique is to apply adaptive histogramequalization (AHE) to overcome the problem of unevenillumination in the document image On the resulting imageamorphological opening operation is applied and the openedimage is added later with the histogram equalized imageAfter that the morphological closing operation is applied tothe image for smoothing The histogram equalized image issubtracted from the smoothed image and the result is sub-tracted again from the previous addition image A Gaussianfilter is subsequently applied in order to remove the noise Afinal improvement is obtained by adding the last image withthe histogram equalized image Finally a global thresholding(Otsursquos algorithm) is required to separate the text from thebackground

233 Thresholding by Background Subtraction This tech-nique has been proposed in [15] and it consists of three stepsThe background is modeled by removing the handwritingby applying a closing of the original image with a smalldisk as a structuring element After that the background issubtracted from the original image which only leaves theforeground Finally the resulting image is segmented usingOtsursquos algorithm multiplied by an empirical constant

4 Advances in Multimedia

(a) Gray level image

Num

ber o

f pi

xels

Gray levels

133

(b) Its histogram

(c) Thresholding result with Otsursquos method

Figure 1 Global thresholding of a bimodal image

234 Tabatabaie and Bohloolrsquos Method It is a nonparametricmethod proposed for the binarization of bad illuminateddocument images [16] In this method the morphologicalclosing operation is used to solve the background unevenillumination problem Indeed closingmay produce a reason-able estimate of the background if we use the appropriatestructuring element Experiments show that a structuringelement of size equal to twice the stroke size gives the bestresults The appropriate structuring element size is estimatedas follows A global threshold is first applied to the originalimage Then we look for the size of the largest black squarefor each pixel and we save these values in a matrix 119872 Thebiggest value of119872 in each connected set of pixels is calculatedand assigned to the other elements of the set After that the S-histogram (119891) is made starting from the matrix119872 The valueof 119891 at the point 119894 is equal to the number of elements of 119872having the value 119894 Finally we determine 119909max the greatestvalue of 119909 with 119909 satisfying

119909

sum

119894=1

119891 (119894) gt 002 times

infin

sum

119894=1

119891 (119894)

2119909

sum

119894=119909

119891 (119894) = 0 (8)

The structuring element size will be 2119909max

3 Proposed Technique

As we said earlier the global thresholding techniques aregenerally simple and fast which tend to calculate a singlethreshold in order to eliminate all background pixels andpreserve all foreground pixels Unfortunately these tech-niques are only applicablewhen the original documents are ofgood quality well contrasted and with a bimodal histogramFigure 1 shows an example

When the documents are of poor quality containingdifferent types of damages (stains transparency effects etc)with a textured background anduneven illumination orwhenthe gray levels of the foreground pixels and the gray levels ofthe background pixels are close it is not possible to find athreshold that completely separates the foreground from thebackground of the image (Figure 2)

In this case a more detailed analysis is needed and wehave recourse to local methods Local methods are moreaccurate and may be applied to variable backgrounds quitedark or with low contrast but they are very slow since thethreshold calculation based on the local neighborhood infor-mation is done for each pixel of the imageThis computationbecomes slower with larger sliding windows

To solve this problem we propose a hybrid thresholdingapproach that will be fast and at the same time effective aswell as local methods and that is achieved by combiningthe advantages of both families of binarization methods Theproposed technique uses two thresholds 119879

1and 119879

2and it

runs in two passes In the first pass a global thresholding isperformed in order to class the most of pixels of the imageAll pixels having a gray-level higher than 119879

2are removed

(becomes white) because they represent the backgroundpixels All pixels having a gray level lower than 119879

1are

considered as foreground pixels and therefore they are keptand colored in black The remaining pixels are left to thesecond pass in which they are locally binarized by combiningthe results of several local thresholding methods to select themost probable value

We detail in the following the processing steps

31 Estimation of Two Thresholds 1198791and 119879

2 The first step in

the binarization process is the calculation of the two thresh-olds119879

1and1198792 Since the thresholding purpose is to divide the

Advances in Multimedia 5

(a) Original image (b) Otsursquos thresholding result

124

Num

ber o

f pi

xels

Gray levels

(c) Corresponding histogram

Figure 2 Global thresholding of degraded image

image into two classes foreground and background and sincea single threshold is not able to accomplish this task the useof more separation thresholds seems be a perfect solution

These two thresholds are estimated from the gray levelshistogram of the original image and represent the averageintensity of the foreground and background respectively

To obtain these two thresholds we first compute a globalthreshold 119879 using a global thresholding algorithm which canbe Otsursquos algorithm Kapur or any other global algorithmIn our approach we opted for the Otsu algorithm becausethis technique has shown its efficiency and overcame otherglobal methods in several comparative studies [17 18] 119879separates the gray levels histogram of the image into twoclasses foreground and background

1198791and 119879

2are estimated from 119879 Noting 119889min the

minimum distance between the average intensity of theforeground is represented by the first half of the histogramand the average intensity of the background is represented bythe second half

1198791= 119879 minus

119889min2

1198792= 119879 +

119889min2

(9)

32 Global Image Thresholding Using 1198791and 119879

2 After the

estimation of the two thresholds 1198791and 119879

2 all pixels having

a gray level higher than 1198792are transformed into white which

eliminates most of the image background and those whosegray level is less than 119879

1are colored in black Note 119868 the

resulting image These pixels are certainly foreground pixelsThe resulting image still contains some noise but all theforeground information is preserved

33 Local Thresholding of the Remaining Pixels The pixelsunprocessed in the previous step (those with a gray levelbetween 119879

1and 119879

2) may be from the foreground and thus

must be preserved likewise they may be backgroundrsquos ornoisersquos pixels and should be removed The decision to assignthe remaining pixels to one of the two classes foreground orbackground is performed using a local process by examiningthe neighborhood of these pixels To guarantee amore correctclassification we propose to apply several local thresholdingmethods In our experiments we chose the following meth-ods Niblack Sauvola and Nick since these methods wereranked in first places in several previous comparative studies[2 19 20]

For each pixel (119909 119910) of 119868 not yet classified we calculatelocally its new binary values (0 for black and 1 for white)

6 Advances in Multimedia

obtained by applying Niblackrsquos Sauvolarsquos and Nickrsquos methodsand we obtain thus three temporary images 119868

1 1198682 and 119868

3

respectively Each one of the three local methods computesthe binary value of each remaining pixel (119909 119910) by

119868119894(119909 119910) =

0 if 119868 (119909 119910) lt LT119894(119909 119910)

255 otherwise

1 le 119894 le 3

(10)

with LT1 LT2 LT3being the local threshold computed using

Niblackrsquos Sauvolarsquos and Nickrsquos methods respectivelyThe final binary value 119868

119887(119909 119910) of each remaining pixel is

that resulted of at least two of the three methods

119868119887(119909 119910) =

0 if3

sum

119894=1

119868119894(119909 119910) lt 2

1 otherwise(11)

4 Experiments and Results

Experiments have been performed in order to estimate theperformance of our approach We applied the proposedtechnique over a large test set and compared the obtainedresults with well-known methods including global localand hybrid methods The comparison will estimate both thebinarization quality and the execution time

Firstly for parameterized methods a series of experi-ments have been performed in order to set their optimalparameter values Note PS(119898) the parameters set of a specificthresholding method 119898 For example PS(Niblack) = 119896 119908PS(Sauvola) = 119896 119908 119877 and so forth We try to find theoptimal values of PS(119898) giving the binarization results whichare closest to the ground truth images A specific range ofvalues [119886 119887] is first defined for each parameter To improvethe accuracy of the process we used a wide initial rangefor every parameter For Niblackrsquos method per example therange of the parameter 119908 is defined as [119886 119887] = [3 299] Afterthat we apply the binarization method 119898 with the differentvalues of PS(119898) from the predefined ranges on the test setdescribed in Section 41 We compare the binarization resultswith the ground truth images using the evaluation measuresdetailed in Section 42 A ranking of the obtained results isthen performed according to each measure separately Bycalculating the sum of all ranks we can infer the optimalset of parameter values as leading to the top ranks Theoptimal parameter values of the parameterized methods aresummarized in Table 1

41 Test Bases Two test sets have been used for the eval-uation of the proposed method The first is a public setcomposed of document images from the four collections pro-posed within the context of the competitions DIBCO 2009(httpusersiitdemokritosgrsimbgatDIBCO2009bench-mark) H-DIBCO 2010 (httpwwwiitdemokritosgrsimbgatH-DIBCO2010benchmark) DIBCO2011 (httputopiaduthgrsimipratikaDIBCO2011benchmark) and H-DIBCO2012 (httputopiaduthgrsimipratikaHDIBCO2012bench-

Table 1 Optimal parameters values of the parameterized methods

Binarization technique Optimal parameter valuesNiblack [11] 119896 = minus02 and 119908 = 35

Sauvola and Pietikainen [3] 119896 = 02 119877 = 128 and 119908 = 27

Nick 119896 = minus01 and 119908 = 19

Improved IGT 119899 = 50 and 119896 = 3

mark) These four collections contain a total of 50 realdocuments images (37 handwritten and 13 printed) comingfrom the collections of several libraries with the associatedground truth images All the images contain representativedegradations which appear frequently (eg variable back-ground intensity shadows smear smudge low contrastand bleed-through) Figure 3 shows some images from thesecollections

The second set of images is a synthetic collection com-posed of 150 synthetic images of documents constructedby the fusion of 15 different backgrounds and 10 binaryimages (Figure 4) The fusion is done by applying the imagemosaicing by superimposing technique for blending [21]Theidea is as follows we start with some images of documentsin black and white which represent the ground truth andwith some backgrounds extracted from old documents andwe apply a fusion procedure to get as many different imagesof old documents However Stathis et al in [22] proposedtwo different techniques for the blendingmaximum intensityand image averaging We adopt using the image averagingtechnique in order to have a more natural result

42 Evaluation Measures In addition to the execution timethe qualitative assessment is performed in terms of fivestandard evaluation measures used in DIBCO 2009 H-DIBCO 2010 DIBCO 2011 and H-DIBCO 2012 which are 119865-measure PSNR NRM MPM and DRD

Note TP TN FP and FN the true positive true negativefalse positive and false negative values respectively

421 119865-measure 119865-measure was introduced first by Chin-chor in [23]

119865-Measure = 2 times Recall times PrecisionRecall + Precision

(12)

where

Recall = TPTP + FN

Precision = TPTP + FP

(13)

422 PSNR PSNR is a similarity measure between twoimages However the higher the value of PSNR the higherthe similarity of the two images [24 25]

PSNR = 10 sdot log( 1198622

MSE) (14)

Advances in Multimedia 7

(a) (b)

(c) (d)

Figure 3 Images taken from the collections of (a) DIBCO 2009 (b) H-DIBCO 2010 (c) DIBCO 2011 and (d) H-DIBCO 2012

(a) (b) (c)

Figure 4 Image of document obtained by fusing binary image and background (a) Background from old document (b) ground truth binaryimage and (c) the resulting synthetic image

where

MSE =

sum119872

119909=1sum119873

119910=1(119868 (119909 119910) minus 119868

2(119909 119910))

2

119872119873

(15)

119868 and 1198682represent the two images matched 119872 and 119873

are there height and width respectively 119862 is the differencebetween foreground and background

423 NRM (Negative Metric Rate) NRM is based on thepixelwise mismatches between the ground truth and the

binarized image [26] It combines the false negative rateNRFNand the false positive rate NRFP It is denoted as follows

NRM =NRFN +NRFP

2(16)

with

NRFN =FN

FN + TP NRFP =

FPFP + TN

(17)

Contrary to 119865-measure and PSNR The better binariza-tion quality is obtained for lower NRM value

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

2 Advances in Multimedia

methods in contrast combine global and local informationfor segmenting the image

In this paper we propose a new hybrid thresholdingtechnique for binarizing images of historical documentsThe proposed approach uses a mixture of thresholdingmethods and it combines the advantages of the two familiesof techniques the computation speed and the efficiencyThe remainder of this paper is organized as follows InSection 2 we present some existing binarization methodsThen in Section 3 we describe the proposed approach Theexperiments performed and the results will be shown inSection 4 before concluding

2 State of the Art

According to [4] existing methodologies for image bina-rization may be divided under two main strategies groupingbased and thresholding based Thresholding based methodsuse global or local threshold(s) to separate the text from thebackground In grouping based methods we distinguish twocategories region based grouping and clustering based group-ing methods Region based grouping methods are mainlybased on spatial-domain region growing or on splitting andmerging Also clustering based grouping methods are basedon classification of intensity or color values as a functionof a homogeneity criterion However several techniqueshave been employed to reach this classification K-meansalgorithm artificial neural networks and so forth

Sezgin and Sankur [5] established a classification ofbinarization methods according to the information that theyexploit in 6 categories

(i) Histogram-based methods the methods of this classperform a thresholding based on the form of thehistogram

(ii) Clustering-based methods these methods assign theimage pixels to one of the two clusters object andbackground

(iii) Entropy-based methods these algorithms use theinformation theory to obtain the threshold

(iv) Object attribute-based methods they find a thresh-old value based on some similarity measurementsbetween original and binary images

(v) Spatial binarization methods they find the optimalthreshold value taking into account spatial measures

(vi) Locally adaptive methods these methods aredesigned to give a new threshold for every pixelSeveral kinds of adaptive methods exist We findmethods based on local gray range local variationand so forth

Wepresent in this section some binarizationmethods themost frequently cited in the literature and we consider onlythresholding based methods

21 Global Methods Note 119868 the grayscale image of whichthe intensities vary from 0 (black) to 1 (white) and 119867 its

histogram of intensities The number of pixels having a graylevel 119894 is denoted as119867(119894)

211 Otsursquos Method Otsursquos method [6] tries to find thethreshold 119879 which separates the gray-level histogram inan optimal way into two segments (which maximize theintersegments variance or whichminimize the intrasegmentsvariance) The calculation of the interclasses or intraclassesvariances is based on the normalized histogram 119867

119899=

[119867119899(0) sdot sdot sdot 119867

119899(255)] of the image where sum119867

119899(119894) = 1

The interclasses variance for each gray level 119905 is given by

119881inter = 1199021(119905) times 119902

2(119905) times [120583

1(119905) minus 120583

2(119905)]2 (1)

such that

1205831(119905) =

1

1199021(119905)

119905minus1

sum

119894=0

119867119899(119894) times 119894

1205832(119905) =

1

1199022(119905)

255

sum

119894=119905

119867119899(119894) times 119894

1199021(119905) =

119905minus1

sum

119894=0

119867119899(119894) 119902

2(119905) =

255

sum

119894=119905

119867119899(119894)

(2)

212 ISODATA Method Thresholding using ISODATA [7]consists in finding a threshold by separating iteratively thegray-level histogram into two classes with the a prioriknowledge of the values associated with each class Thismethod starts by dividing the interval of nonnull values ofthe histogram into two equidistant parts and next we take119898

1

and 1198982as the arithmetic average of each class Repeat until

convergence the calculation of the optimal threshold 119879 as theclosest integer to (119898

1+1198982)2 and update the two averages119898

1

and1198982

213 Kapur et alrsquos Method Kapur et alrsquos method [8] isan entropy based method which takes into account theforeground likelihood distribution 119875

119891and the background

likelihood distribution (119875119887= 1 minus 119875

119891) in the determination of

the division entropy The binarization threshold 119879 is chosenfor which the value 119864 = 119864

119891+ 119864119887is maximal such that

119864119891= minus

119905

sum

119894=0

119875119894

119875119891

times log(119875119894

119875119891

)

119864119887= minus

255

sum

119894=119905+1

119875119894

1 minus 119875119891

times log(119875119894

1 minus 119875119891

)

(3)

where 119875119894is the occurrence probability of the gray level 119894 in the

image and 119875119891= sum119905

119894=0119875119894

214 Iterative Global Thresholding (IGT) The proposedmethod selects a global threshold to the entire image basedon an iterative procedure [9] At each iteration 119894 the followingsteps are performed

(a) calculating the average gray level (119879119894) of the image

(b) subtracting 119879119894from all pixels of the image

Advances in Multimedia 3

(c) histogram equalization to extend the pixels over thewhole gray levels interval

The algorithm stops when |119879119894minus 119879119894minus1| lt 0001

22 LocalMethods Localmethods compute a local thresholdfor each pixel by sliding a square or rectangular window overthe entire image

221 Bernsenrsquos Method It is an adaptive local method [10]Thus for each pixel of coordinates (119909 119910) the threshold isgiven by

119879 (119909 119910) =

119885low + 119885high

2

(4)

such that 119885low and 119885high are the lowest and the highest graylevels respectively in a squared window119908times119908 centered overthe pixel (119909 119910)

However if the local contrast 119862(119909 119910) = (119885high minus 119885low) isbelow a threshold 119897 (119897 = 15) then the neighborhood consistsof a single class foreground or background

222 Niblackrsquos Method The local threshold 119879(119909 119910) is calcu-lated using the mean119898 and standard deviation 120590 of all pixelsin the window (neighborhood of the pixel in question) [11]Thus the threshold 119879(119909 119910) is given by

119879 (119909 119910) = 119898 + 119896 times 120590 (5)

such that 119896 is a parameter used for determining the numberof edge pixels considered as object pixels and takes a negativevalues (119896 is fixed minus02 by authors)

223 Sauvola and Pietikainenrsquos Method Sauvola andPietikainenrsquos algorithm [3] is a modification of that ofNiblack in order to give more performance in the documentswith a background containing a light texture or too variationand uneven illumination In the modification of Sauvola thelocal binarization threshold is given by

119879 (119909 119910) = 119898 times (1 minus 119896 times (1 minus120590

119877)) (6)

where 119877 is the dynamic range of the standard deviation 120590 andthe parameter 119896 takes positives values in the interval [02 05]

224 Nick Method This method improves considerably thebinarization of lighted images and low contrasted images bydownwards moving the threshold of binarization [2] Thethreshold calculation is done as follows

119879 (119909 119910) = 119898 + 119896radic(sum1199012

119894minus 1198982

)

NP(7)

such that 119896 is the Niblack factor and varies between minus01 andminus02 according to the application need119898 is the average graylevel 119901

119894is the gray level of pixel 119894 and NP is the total number

of pixels In their tests the authors used a window of size 19times19

225 Sari et alrsquos Method This method uses an artificialneuron network of multilayer perceptron (MLP) type toclassify the image pixels into two classes foreground andbackground [12] The MLP has one hidden layer 25 inputsand one single output To assign a new value (black orwhite) to a pixel the MLP takes as input a vector of 25values corresponding to the intensities of the pixel in a5 times 5 window centered on the processed pixel The MLPparameters (structure the input statistics etc) have beenchosen after several experiments

23 Hybrid Methods

231 Improved IGTMethod Thismethod [13] is an improve-ment of IGT technique from [9] and it consists of two passesIn the first pass a global thresholding is applied to the entireimage and in the second pass a local thresholding processesareas still containing noise To do this the binary imageresulting from global thresholding is divided into severalsegments of size 119899 times 119899 and for each segment the frequency119891 of black pixels is calculated The segments satisfying thefollowing criteria are kept 119891 gt 120583 + 119896 times 120590 such that 120583 and120590 denote the mean and the standard deviation of the blackpixels frequency in the segment and 119896 is a constant (equal to2 according to the authors) For each detected area the IGTmethod is applied to the corresponding area in the originalimage Areas of size 50times50 give good results according to theauthors

232 Gangamma and Srikantarsquos Method Gangamma andSrikanta [14] proposed amethod based on a simple and effec-tive combination of spatial filters with gray scale morpho-logical operations to remove the background and improvethe quality of historical document image of palm scripts Thefirst step of this technique is to apply adaptive histogramequalization (AHE) to overcome the problem of unevenillumination in the document image On the resulting imageamorphological opening operation is applied and the openedimage is added later with the histogram equalized imageAfter that the morphological closing operation is applied tothe image for smoothing The histogram equalized image issubtracted from the smoothed image and the result is sub-tracted again from the previous addition image A Gaussianfilter is subsequently applied in order to remove the noise Afinal improvement is obtained by adding the last image withthe histogram equalized image Finally a global thresholding(Otsursquos algorithm) is required to separate the text from thebackground

233 Thresholding by Background Subtraction This tech-nique has been proposed in [15] and it consists of three stepsThe background is modeled by removing the handwritingby applying a closing of the original image with a smalldisk as a structuring element After that the background issubtracted from the original image which only leaves theforeground Finally the resulting image is segmented usingOtsursquos algorithm multiplied by an empirical constant

4 Advances in Multimedia

(a) Gray level image

Num

ber o

f pi

xels

Gray levels

133

(b) Its histogram

(c) Thresholding result with Otsursquos method

Figure 1 Global thresholding of a bimodal image

234 Tabatabaie and Bohloolrsquos Method It is a nonparametricmethod proposed for the binarization of bad illuminateddocument images [16] In this method the morphologicalclosing operation is used to solve the background unevenillumination problem Indeed closingmay produce a reason-able estimate of the background if we use the appropriatestructuring element Experiments show that a structuringelement of size equal to twice the stroke size gives the bestresults The appropriate structuring element size is estimatedas follows A global threshold is first applied to the originalimage Then we look for the size of the largest black squarefor each pixel and we save these values in a matrix 119872 Thebiggest value of119872 in each connected set of pixels is calculatedand assigned to the other elements of the set After that the S-histogram (119891) is made starting from the matrix119872 The valueof 119891 at the point 119894 is equal to the number of elements of 119872having the value 119894 Finally we determine 119909max the greatestvalue of 119909 with 119909 satisfying

119909

sum

119894=1

119891 (119894) gt 002 times

infin

sum

119894=1

119891 (119894)

2119909

sum

119894=119909

119891 (119894) = 0 (8)

The structuring element size will be 2119909max

3 Proposed Technique

As we said earlier the global thresholding techniques aregenerally simple and fast which tend to calculate a singlethreshold in order to eliminate all background pixels andpreserve all foreground pixels Unfortunately these tech-niques are only applicablewhen the original documents are ofgood quality well contrasted and with a bimodal histogramFigure 1 shows an example

When the documents are of poor quality containingdifferent types of damages (stains transparency effects etc)with a textured background anduneven illumination orwhenthe gray levels of the foreground pixels and the gray levels ofthe background pixels are close it is not possible to find athreshold that completely separates the foreground from thebackground of the image (Figure 2)

In this case a more detailed analysis is needed and wehave recourse to local methods Local methods are moreaccurate and may be applied to variable backgrounds quitedark or with low contrast but they are very slow since thethreshold calculation based on the local neighborhood infor-mation is done for each pixel of the imageThis computationbecomes slower with larger sliding windows

To solve this problem we propose a hybrid thresholdingapproach that will be fast and at the same time effective aswell as local methods and that is achieved by combiningthe advantages of both families of binarization methods Theproposed technique uses two thresholds 119879

1and 119879

2and it

runs in two passes In the first pass a global thresholding isperformed in order to class the most of pixels of the imageAll pixels having a gray-level higher than 119879

2are removed

(becomes white) because they represent the backgroundpixels All pixels having a gray level lower than 119879

1are

considered as foreground pixels and therefore they are keptand colored in black The remaining pixels are left to thesecond pass in which they are locally binarized by combiningthe results of several local thresholding methods to select themost probable value

We detail in the following the processing steps

31 Estimation of Two Thresholds 1198791and 119879

2 The first step in

the binarization process is the calculation of the two thresh-olds119879

1and1198792 Since the thresholding purpose is to divide the

Advances in Multimedia 5

(a) Original image (b) Otsursquos thresholding result

124

Num

ber o

f pi

xels

Gray levels

(c) Corresponding histogram

Figure 2 Global thresholding of degraded image

image into two classes foreground and background and sincea single threshold is not able to accomplish this task the useof more separation thresholds seems be a perfect solution

These two thresholds are estimated from the gray levelshistogram of the original image and represent the averageintensity of the foreground and background respectively

To obtain these two thresholds we first compute a globalthreshold 119879 using a global thresholding algorithm which canbe Otsursquos algorithm Kapur or any other global algorithmIn our approach we opted for the Otsu algorithm becausethis technique has shown its efficiency and overcame otherglobal methods in several comparative studies [17 18] 119879separates the gray levels histogram of the image into twoclasses foreground and background

1198791and 119879

2are estimated from 119879 Noting 119889min the

minimum distance between the average intensity of theforeground is represented by the first half of the histogramand the average intensity of the background is represented bythe second half

1198791= 119879 minus

119889min2

1198792= 119879 +

119889min2

(9)

32 Global Image Thresholding Using 1198791and 119879

2 After the

estimation of the two thresholds 1198791and 119879

2 all pixels having

a gray level higher than 1198792are transformed into white which

eliminates most of the image background and those whosegray level is less than 119879

1are colored in black Note 119868 the

resulting image These pixels are certainly foreground pixelsThe resulting image still contains some noise but all theforeground information is preserved

33 Local Thresholding of the Remaining Pixels The pixelsunprocessed in the previous step (those with a gray levelbetween 119879

1and 119879

2) may be from the foreground and thus

must be preserved likewise they may be backgroundrsquos ornoisersquos pixels and should be removed The decision to assignthe remaining pixels to one of the two classes foreground orbackground is performed using a local process by examiningthe neighborhood of these pixels To guarantee amore correctclassification we propose to apply several local thresholdingmethods In our experiments we chose the following meth-ods Niblack Sauvola and Nick since these methods wereranked in first places in several previous comparative studies[2 19 20]

For each pixel (119909 119910) of 119868 not yet classified we calculatelocally its new binary values (0 for black and 1 for white)

6 Advances in Multimedia

obtained by applying Niblackrsquos Sauvolarsquos and Nickrsquos methodsand we obtain thus three temporary images 119868

1 1198682 and 119868

3

respectively Each one of the three local methods computesthe binary value of each remaining pixel (119909 119910) by

119868119894(119909 119910) =

0 if 119868 (119909 119910) lt LT119894(119909 119910)

255 otherwise

1 le 119894 le 3

(10)

with LT1 LT2 LT3being the local threshold computed using

Niblackrsquos Sauvolarsquos and Nickrsquos methods respectivelyThe final binary value 119868

119887(119909 119910) of each remaining pixel is

that resulted of at least two of the three methods

119868119887(119909 119910) =

0 if3

sum

119894=1

119868119894(119909 119910) lt 2

1 otherwise(11)

4 Experiments and Results

Experiments have been performed in order to estimate theperformance of our approach We applied the proposedtechnique over a large test set and compared the obtainedresults with well-known methods including global localand hybrid methods The comparison will estimate both thebinarization quality and the execution time

Firstly for parameterized methods a series of experi-ments have been performed in order to set their optimalparameter values Note PS(119898) the parameters set of a specificthresholding method 119898 For example PS(Niblack) = 119896 119908PS(Sauvola) = 119896 119908 119877 and so forth We try to find theoptimal values of PS(119898) giving the binarization results whichare closest to the ground truth images A specific range ofvalues [119886 119887] is first defined for each parameter To improvethe accuracy of the process we used a wide initial rangefor every parameter For Niblackrsquos method per example therange of the parameter 119908 is defined as [119886 119887] = [3 299] Afterthat we apply the binarization method 119898 with the differentvalues of PS(119898) from the predefined ranges on the test setdescribed in Section 41 We compare the binarization resultswith the ground truth images using the evaluation measuresdetailed in Section 42 A ranking of the obtained results isthen performed according to each measure separately Bycalculating the sum of all ranks we can infer the optimalset of parameter values as leading to the top ranks Theoptimal parameter values of the parameterized methods aresummarized in Table 1

41 Test Bases Two test sets have been used for the eval-uation of the proposed method The first is a public setcomposed of document images from the four collections pro-posed within the context of the competitions DIBCO 2009(httpusersiitdemokritosgrsimbgatDIBCO2009bench-mark) H-DIBCO 2010 (httpwwwiitdemokritosgrsimbgatH-DIBCO2010benchmark) DIBCO2011 (httputopiaduthgrsimipratikaDIBCO2011benchmark) and H-DIBCO2012 (httputopiaduthgrsimipratikaHDIBCO2012bench-

Table 1 Optimal parameters values of the parameterized methods

Binarization technique Optimal parameter valuesNiblack [11] 119896 = minus02 and 119908 = 35

Sauvola and Pietikainen [3] 119896 = 02 119877 = 128 and 119908 = 27

Nick 119896 = minus01 and 119908 = 19

Improved IGT 119899 = 50 and 119896 = 3

mark) These four collections contain a total of 50 realdocuments images (37 handwritten and 13 printed) comingfrom the collections of several libraries with the associatedground truth images All the images contain representativedegradations which appear frequently (eg variable back-ground intensity shadows smear smudge low contrastand bleed-through) Figure 3 shows some images from thesecollections

The second set of images is a synthetic collection com-posed of 150 synthetic images of documents constructedby the fusion of 15 different backgrounds and 10 binaryimages (Figure 4) The fusion is done by applying the imagemosaicing by superimposing technique for blending [21]Theidea is as follows we start with some images of documentsin black and white which represent the ground truth andwith some backgrounds extracted from old documents andwe apply a fusion procedure to get as many different imagesof old documents However Stathis et al in [22] proposedtwo different techniques for the blendingmaximum intensityand image averaging We adopt using the image averagingtechnique in order to have a more natural result

42 Evaluation Measures In addition to the execution timethe qualitative assessment is performed in terms of fivestandard evaluation measures used in DIBCO 2009 H-DIBCO 2010 DIBCO 2011 and H-DIBCO 2012 which are 119865-measure PSNR NRM MPM and DRD

Note TP TN FP and FN the true positive true negativefalse positive and false negative values respectively

421 119865-measure 119865-measure was introduced first by Chin-chor in [23]

119865-Measure = 2 times Recall times PrecisionRecall + Precision

(12)

where

Recall = TPTP + FN

Precision = TPTP + FP

(13)

422 PSNR PSNR is a similarity measure between twoimages However the higher the value of PSNR the higherthe similarity of the two images [24 25]

PSNR = 10 sdot log( 1198622

MSE) (14)

Advances in Multimedia 7

(a) (b)

(c) (d)

Figure 3 Images taken from the collections of (a) DIBCO 2009 (b) H-DIBCO 2010 (c) DIBCO 2011 and (d) H-DIBCO 2012

(a) (b) (c)

Figure 4 Image of document obtained by fusing binary image and background (a) Background from old document (b) ground truth binaryimage and (c) the resulting synthetic image

where

MSE =

sum119872

119909=1sum119873

119910=1(119868 (119909 119910) minus 119868

2(119909 119910))

2

119872119873

(15)

119868 and 1198682represent the two images matched 119872 and 119873

are there height and width respectively 119862 is the differencebetween foreground and background

423 NRM (Negative Metric Rate) NRM is based on thepixelwise mismatches between the ground truth and the

binarized image [26] It combines the false negative rateNRFNand the false positive rate NRFP It is denoted as follows

NRM =NRFN +NRFP

2(16)

with

NRFN =FN

FN + TP NRFP =

FPFP + TN

(17)

Contrary to 119865-measure and PSNR The better binariza-tion quality is obtained for lower NRM value

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Advances in Multimedia 3

(c) histogram equalization to extend the pixels over thewhole gray levels interval

The algorithm stops when |119879119894minus 119879119894minus1| lt 0001

22 LocalMethods Localmethods compute a local thresholdfor each pixel by sliding a square or rectangular window overthe entire image

221 Bernsenrsquos Method It is an adaptive local method [10]Thus for each pixel of coordinates (119909 119910) the threshold isgiven by

119879 (119909 119910) =

119885low + 119885high

2

(4)

such that 119885low and 119885high are the lowest and the highest graylevels respectively in a squared window119908times119908 centered overthe pixel (119909 119910)

However if the local contrast 119862(119909 119910) = (119885high minus 119885low) isbelow a threshold 119897 (119897 = 15) then the neighborhood consistsof a single class foreground or background

222 Niblackrsquos Method The local threshold 119879(119909 119910) is calcu-lated using the mean119898 and standard deviation 120590 of all pixelsin the window (neighborhood of the pixel in question) [11]Thus the threshold 119879(119909 119910) is given by

119879 (119909 119910) = 119898 + 119896 times 120590 (5)

such that 119896 is a parameter used for determining the numberof edge pixels considered as object pixels and takes a negativevalues (119896 is fixed minus02 by authors)

223 Sauvola and Pietikainenrsquos Method Sauvola andPietikainenrsquos algorithm [3] is a modification of that ofNiblack in order to give more performance in the documentswith a background containing a light texture or too variationand uneven illumination In the modification of Sauvola thelocal binarization threshold is given by

119879 (119909 119910) = 119898 times (1 minus 119896 times (1 minus120590

119877)) (6)

where 119877 is the dynamic range of the standard deviation 120590 andthe parameter 119896 takes positives values in the interval [02 05]

224 Nick Method This method improves considerably thebinarization of lighted images and low contrasted images bydownwards moving the threshold of binarization [2] Thethreshold calculation is done as follows

119879 (119909 119910) = 119898 + 119896radic(sum1199012

119894minus 1198982

)

NP(7)

such that 119896 is the Niblack factor and varies between minus01 andminus02 according to the application need119898 is the average graylevel 119901

119894is the gray level of pixel 119894 and NP is the total number

of pixels In their tests the authors used a window of size 19times19

225 Sari et alrsquos Method This method uses an artificialneuron network of multilayer perceptron (MLP) type toclassify the image pixels into two classes foreground andbackground [12] The MLP has one hidden layer 25 inputsand one single output To assign a new value (black orwhite) to a pixel the MLP takes as input a vector of 25values corresponding to the intensities of the pixel in a5 times 5 window centered on the processed pixel The MLPparameters (structure the input statistics etc) have beenchosen after several experiments

23 Hybrid Methods

231 Improved IGTMethod Thismethod [13] is an improve-ment of IGT technique from [9] and it consists of two passesIn the first pass a global thresholding is applied to the entireimage and in the second pass a local thresholding processesareas still containing noise To do this the binary imageresulting from global thresholding is divided into severalsegments of size 119899 times 119899 and for each segment the frequency119891 of black pixels is calculated The segments satisfying thefollowing criteria are kept 119891 gt 120583 + 119896 times 120590 such that 120583 and120590 denote the mean and the standard deviation of the blackpixels frequency in the segment and 119896 is a constant (equal to2 according to the authors) For each detected area the IGTmethod is applied to the corresponding area in the originalimage Areas of size 50times50 give good results according to theauthors

232 Gangamma and Srikantarsquos Method Gangamma andSrikanta [14] proposed amethod based on a simple and effec-tive combination of spatial filters with gray scale morpho-logical operations to remove the background and improvethe quality of historical document image of palm scripts Thefirst step of this technique is to apply adaptive histogramequalization (AHE) to overcome the problem of unevenillumination in the document image On the resulting imageamorphological opening operation is applied and the openedimage is added later with the histogram equalized imageAfter that the morphological closing operation is applied tothe image for smoothing The histogram equalized image issubtracted from the smoothed image and the result is sub-tracted again from the previous addition image A Gaussianfilter is subsequently applied in order to remove the noise Afinal improvement is obtained by adding the last image withthe histogram equalized image Finally a global thresholding(Otsursquos algorithm) is required to separate the text from thebackground

233 Thresholding by Background Subtraction This tech-nique has been proposed in [15] and it consists of three stepsThe background is modeled by removing the handwritingby applying a closing of the original image with a smalldisk as a structuring element After that the background issubtracted from the original image which only leaves theforeground Finally the resulting image is segmented usingOtsursquos algorithm multiplied by an empirical constant

4 Advances in Multimedia

(a) Gray level image

Num

ber o

f pi

xels

Gray levels

133

(b) Its histogram

(c) Thresholding result with Otsursquos method

Figure 1 Global thresholding of a bimodal image

234 Tabatabaie and Bohloolrsquos Method It is a nonparametricmethod proposed for the binarization of bad illuminateddocument images [16] In this method the morphologicalclosing operation is used to solve the background unevenillumination problem Indeed closingmay produce a reason-able estimate of the background if we use the appropriatestructuring element Experiments show that a structuringelement of size equal to twice the stroke size gives the bestresults The appropriate structuring element size is estimatedas follows A global threshold is first applied to the originalimage Then we look for the size of the largest black squarefor each pixel and we save these values in a matrix 119872 Thebiggest value of119872 in each connected set of pixels is calculatedand assigned to the other elements of the set After that the S-histogram (119891) is made starting from the matrix119872 The valueof 119891 at the point 119894 is equal to the number of elements of 119872having the value 119894 Finally we determine 119909max the greatestvalue of 119909 with 119909 satisfying

119909

sum

119894=1

119891 (119894) gt 002 times

infin

sum

119894=1

119891 (119894)

2119909

sum

119894=119909

119891 (119894) = 0 (8)

The structuring element size will be 2119909max

3 Proposed Technique

As we said earlier the global thresholding techniques aregenerally simple and fast which tend to calculate a singlethreshold in order to eliminate all background pixels andpreserve all foreground pixels Unfortunately these tech-niques are only applicablewhen the original documents are ofgood quality well contrasted and with a bimodal histogramFigure 1 shows an example

When the documents are of poor quality containingdifferent types of damages (stains transparency effects etc)with a textured background anduneven illumination orwhenthe gray levels of the foreground pixels and the gray levels ofthe background pixels are close it is not possible to find athreshold that completely separates the foreground from thebackground of the image (Figure 2)

In this case a more detailed analysis is needed and wehave recourse to local methods Local methods are moreaccurate and may be applied to variable backgrounds quitedark or with low contrast but they are very slow since thethreshold calculation based on the local neighborhood infor-mation is done for each pixel of the imageThis computationbecomes slower with larger sliding windows

To solve this problem we propose a hybrid thresholdingapproach that will be fast and at the same time effective aswell as local methods and that is achieved by combiningthe advantages of both families of binarization methods Theproposed technique uses two thresholds 119879

1and 119879

2and it

runs in two passes In the first pass a global thresholding isperformed in order to class the most of pixels of the imageAll pixels having a gray-level higher than 119879

2are removed

(becomes white) because they represent the backgroundpixels All pixels having a gray level lower than 119879

1are

considered as foreground pixels and therefore they are keptand colored in black The remaining pixels are left to thesecond pass in which they are locally binarized by combiningthe results of several local thresholding methods to select themost probable value

We detail in the following the processing steps

31 Estimation of Two Thresholds 1198791and 119879

2 The first step in

the binarization process is the calculation of the two thresh-olds119879

1and1198792 Since the thresholding purpose is to divide the

Advances in Multimedia 5

(a) Original image (b) Otsursquos thresholding result

124

Num

ber o

f pi

xels

Gray levels

(c) Corresponding histogram

Figure 2 Global thresholding of degraded image

image into two classes foreground and background and sincea single threshold is not able to accomplish this task the useof more separation thresholds seems be a perfect solution

These two thresholds are estimated from the gray levelshistogram of the original image and represent the averageintensity of the foreground and background respectively

To obtain these two thresholds we first compute a globalthreshold 119879 using a global thresholding algorithm which canbe Otsursquos algorithm Kapur or any other global algorithmIn our approach we opted for the Otsu algorithm becausethis technique has shown its efficiency and overcame otherglobal methods in several comparative studies [17 18] 119879separates the gray levels histogram of the image into twoclasses foreground and background

1198791and 119879

2are estimated from 119879 Noting 119889min the

minimum distance between the average intensity of theforeground is represented by the first half of the histogramand the average intensity of the background is represented bythe second half

1198791= 119879 minus

119889min2

1198792= 119879 +

119889min2

(9)

32 Global Image Thresholding Using 1198791and 119879

2 After the

estimation of the two thresholds 1198791and 119879

2 all pixels having

a gray level higher than 1198792are transformed into white which

eliminates most of the image background and those whosegray level is less than 119879

1are colored in black Note 119868 the

resulting image These pixels are certainly foreground pixelsThe resulting image still contains some noise but all theforeground information is preserved

33 Local Thresholding of the Remaining Pixels The pixelsunprocessed in the previous step (those with a gray levelbetween 119879

1and 119879

2) may be from the foreground and thus

must be preserved likewise they may be backgroundrsquos ornoisersquos pixels and should be removed The decision to assignthe remaining pixels to one of the two classes foreground orbackground is performed using a local process by examiningthe neighborhood of these pixels To guarantee amore correctclassification we propose to apply several local thresholdingmethods In our experiments we chose the following meth-ods Niblack Sauvola and Nick since these methods wereranked in first places in several previous comparative studies[2 19 20]

For each pixel (119909 119910) of 119868 not yet classified we calculatelocally its new binary values (0 for black and 1 for white)

6 Advances in Multimedia

obtained by applying Niblackrsquos Sauvolarsquos and Nickrsquos methodsand we obtain thus three temporary images 119868

1 1198682 and 119868

3

respectively Each one of the three local methods computesthe binary value of each remaining pixel (119909 119910) by

119868119894(119909 119910) =

0 if 119868 (119909 119910) lt LT119894(119909 119910)

255 otherwise

1 le 119894 le 3

(10)

with LT1 LT2 LT3being the local threshold computed using

Niblackrsquos Sauvolarsquos and Nickrsquos methods respectivelyThe final binary value 119868

119887(119909 119910) of each remaining pixel is

that resulted of at least two of the three methods

119868119887(119909 119910) =

0 if3

sum

119894=1

119868119894(119909 119910) lt 2

1 otherwise(11)

4 Experiments and Results

Experiments have been performed in order to estimate theperformance of our approach We applied the proposedtechnique over a large test set and compared the obtainedresults with well-known methods including global localand hybrid methods The comparison will estimate both thebinarization quality and the execution time

Firstly for parameterized methods a series of experi-ments have been performed in order to set their optimalparameter values Note PS(119898) the parameters set of a specificthresholding method 119898 For example PS(Niblack) = 119896 119908PS(Sauvola) = 119896 119908 119877 and so forth We try to find theoptimal values of PS(119898) giving the binarization results whichare closest to the ground truth images A specific range ofvalues [119886 119887] is first defined for each parameter To improvethe accuracy of the process we used a wide initial rangefor every parameter For Niblackrsquos method per example therange of the parameter 119908 is defined as [119886 119887] = [3 299] Afterthat we apply the binarization method 119898 with the differentvalues of PS(119898) from the predefined ranges on the test setdescribed in Section 41 We compare the binarization resultswith the ground truth images using the evaluation measuresdetailed in Section 42 A ranking of the obtained results isthen performed according to each measure separately Bycalculating the sum of all ranks we can infer the optimalset of parameter values as leading to the top ranks Theoptimal parameter values of the parameterized methods aresummarized in Table 1

41 Test Bases Two test sets have been used for the eval-uation of the proposed method The first is a public setcomposed of document images from the four collections pro-posed within the context of the competitions DIBCO 2009(httpusersiitdemokritosgrsimbgatDIBCO2009bench-mark) H-DIBCO 2010 (httpwwwiitdemokritosgrsimbgatH-DIBCO2010benchmark) DIBCO2011 (httputopiaduthgrsimipratikaDIBCO2011benchmark) and H-DIBCO2012 (httputopiaduthgrsimipratikaHDIBCO2012bench-

Table 1 Optimal parameters values of the parameterized methods

Binarization technique Optimal parameter valuesNiblack [11] 119896 = minus02 and 119908 = 35

Sauvola and Pietikainen [3] 119896 = 02 119877 = 128 and 119908 = 27

Nick 119896 = minus01 and 119908 = 19

Improved IGT 119899 = 50 and 119896 = 3

mark) These four collections contain a total of 50 realdocuments images (37 handwritten and 13 printed) comingfrom the collections of several libraries with the associatedground truth images All the images contain representativedegradations which appear frequently (eg variable back-ground intensity shadows smear smudge low contrastand bleed-through) Figure 3 shows some images from thesecollections

The second set of images is a synthetic collection com-posed of 150 synthetic images of documents constructedby the fusion of 15 different backgrounds and 10 binaryimages (Figure 4) The fusion is done by applying the imagemosaicing by superimposing technique for blending [21]Theidea is as follows we start with some images of documentsin black and white which represent the ground truth andwith some backgrounds extracted from old documents andwe apply a fusion procedure to get as many different imagesof old documents However Stathis et al in [22] proposedtwo different techniques for the blendingmaximum intensityand image averaging We adopt using the image averagingtechnique in order to have a more natural result

42 Evaluation Measures In addition to the execution timethe qualitative assessment is performed in terms of fivestandard evaluation measures used in DIBCO 2009 H-DIBCO 2010 DIBCO 2011 and H-DIBCO 2012 which are 119865-measure PSNR NRM MPM and DRD

Note TP TN FP and FN the true positive true negativefalse positive and false negative values respectively

421 119865-measure 119865-measure was introduced first by Chin-chor in [23]

119865-Measure = 2 times Recall times PrecisionRecall + Precision

(12)

where

Recall = TPTP + FN

Precision = TPTP + FP

(13)

422 PSNR PSNR is a similarity measure between twoimages However the higher the value of PSNR the higherthe similarity of the two images [24 25]

PSNR = 10 sdot log( 1198622

MSE) (14)

Advances in Multimedia 7

(a) (b)

(c) (d)

Figure 3 Images taken from the collections of (a) DIBCO 2009 (b) H-DIBCO 2010 (c) DIBCO 2011 and (d) H-DIBCO 2012

(a) (b) (c)

Figure 4 Image of document obtained by fusing binary image and background (a) Background from old document (b) ground truth binaryimage and (c) the resulting synthetic image

where

MSE =

sum119872

119909=1sum119873

119910=1(119868 (119909 119910) minus 119868

2(119909 119910))

2

119872119873

(15)

119868 and 1198682represent the two images matched 119872 and 119873

are there height and width respectively 119862 is the differencebetween foreground and background

423 NRM (Negative Metric Rate) NRM is based on thepixelwise mismatches between the ground truth and the

binarized image [26] It combines the false negative rateNRFNand the false positive rate NRFP It is denoted as follows

NRM =NRFN +NRFP

2(16)

with

NRFN =FN

FN + TP NRFP =

FPFP + TN

(17)

Contrary to 119865-measure and PSNR The better binariza-tion quality is obtained for lower NRM value

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

4 Advances in Multimedia

(a) Gray level image

Num

ber o

f pi

xels

Gray levels

133

(b) Its histogram

(c) Thresholding result with Otsursquos method

Figure 1 Global thresholding of a bimodal image

234 Tabatabaie and Bohloolrsquos Method It is a nonparametricmethod proposed for the binarization of bad illuminateddocument images [16] In this method the morphologicalclosing operation is used to solve the background unevenillumination problem Indeed closingmay produce a reason-able estimate of the background if we use the appropriatestructuring element Experiments show that a structuringelement of size equal to twice the stroke size gives the bestresults The appropriate structuring element size is estimatedas follows A global threshold is first applied to the originalimage Then we look for the size of the largest black squarefor each pixel and we save these values in a matrix 119872 Thebiggest value of119872 in each connected set of pixels is calculatedand assigned to the other elements of the set After that the S-histogram (119891) is made starting from the matrix119872 The valueof 119891 at the point 119894 is equal to the number of elements of 119872having the value 119894 Finally we determine 119909max the greatestvalue of 119909 with 119909 satisfying

119909

sum

119894=1

119891 (119894) gt 002 times

infin

sum

119894=1

119891 (119894)

2119909

sum

119894=119909

119891 (119894) = 0 (8)

The structuring element size will be 2119909max

3 Proposed Technique

As we said earlier the global thresholding techniques aregenerally simple and fast which tend to calculate a singlethreshold in order to eliminate all background pixels andpreserve all foreground pixels Unfortunately these tech-niques are only applicablewhen the original documents are ofgood quality well contrasted and with a bimodal histogramFigure 1 shows an example

When the documents are of poor quality containingdifferent types of damages (stains transparency effects etc)with a textured background anduneven illumination orwhenthe gray levels of the foreground pixels and the gray levels ofthe background pixels are close it is not possible to find athreshold that completely separates the foreground from thebackground of the image (Figure 2)

In this case a more detailed analysis is needed and wehave recourse to local methods Local methods are moreaccurate and may be applied to variable backgrounds quitedark or with low contrast but they are very slow since thethreshold calculation based on the local neighborhood infor-mation is done for each pixel of the imageThis computationbecomes slower with larger sliding windows

To solve this problem we propose a hybrid thresholdingapproach that will be fast and at the same time effective aswell as local methods and that is achieved by combiningthe advantages of both families of binarization methods Theproposed technique uses two thresholds 119879

1and 119879

2and it

runs in two passes In the first pass a global thresholding isperformed in order to class the most of pixels of the imageAll pixels having a gray-level higher than 119879

2are removed

(becomes white) because they represent the backgroundpixels All pixels having a gray level lower than 119879

1are

considered as foreground pixels and therefore they are keptand colored in black The remaining pixels are left to thesecond pass in which they are locally binarized by combiningthe results of several local thresholding methods to select themost probable value

We detail in the following the processing steps

31 Estimation of Two Thresholds 1198791and 119879

2 The first step in

the binarization process is the calculation of the two thresh-olds119879

1and1198792 Since the thresholding purpose is to divide the

Advances in Multimedia 5

(a) Original image (b) Otsursquos thresholding result

124

Num

ber o

f pi

xels

Gray levels

(c) Corresponding histogram

Figure 2 Global thresholding of degraded image

image into two classes foreground and background and sincea single threshold is not able to accomplish this task the useof more separation thresholds seems be a perfect solution

These two thresholds are estimated from the gray levelshistogram of the original image and represent the averageintensity of the foreground and background respectively

To obtain these two thresholds we first compute a globalthreshold 119879 using a global thresholding algorithm which canbe Otsursquos algorithm Kapur or any other global algorithmIn our approach we opted for the Otsu algorithm becausethis technique has shown its efficiency and overcame otherglobal methods in several comparative studies [17 18] 119879separates the gray levels histogram of the image into twoclasses foreground and background

1198791and 119879

2are estimated from 119879 Noting 119889min the

minimum distance between the average intensity of theforeground is represented by the first half of the histogramand the average intensity of the background is represented bythe second half

1198791= 119879 minus

119889min2

1198792= 119879 +

119889min2

(9)

32 Global Image Thresholding Using 1198791and 119879

2 After the

estimation of the two thresholds 1198791and 119879

2 all pixels having

a gray level higher than 1198792are transformed into white which

eliminates most of the image background and those whosegray level is less than 119879

1are colored in black Note 119868 the

resulting image These pixels are certainly foreground pixelsThe resulting image still contains some noise but all theforeground information is preserved

33 Local Thresholding of the Remaining Pixels The pixelsunprocessed in the previous step (those with a gray levelbetween 119879

1and 119879

2) may be from the foreground and thus

must be preserved likewise they may be backgroundrsquos ornoisersquos pixels and should be removed The decision to assignthe remaining pixels to one of the two classes foreground orbackground is performed using a local process by examiningthe neighborhood of these pixels To guarantee amore correctclassification we propose to apply several local thresholdingmethods In our experiments we chose the following meth-ods Niblack Sauvola and Nick since these methods wereranked in first places in several previous comparative studies[2 19 20]

For each pixel (119909 119910) of 119868 not yet classified we calculatelocally its new binary values (0 for black and 1 for white)

6 Advances in Multimedia

obtained by applying Niblackrsquos Sauvolarsquos and Nickrsquos methodsand we obtain thus three temporary images 119868

1 1198682 and 119868

3

respectively Each one of the three local methods computesthe binary value of each remaining pixel (119909 119910) by

119868119894(119909 119910) =

0 if 119868 (119909 119910) lt LT119894(119909 119910)

255 otherwise

1 le 119894 le 3

(10)

with LT1 LT2 LT3being the local threshold computed using

Niblackrsquos Sauvolarsquos and Nickrsquos methods respectivelyThe final binary value 119868

119887(119909 119910) of each remaining pixel is

that resulted of at least two of the three methods

119868119887(119909 119910) =

0 if3

sum

119894=1

119868119894(119909 119910) lt 2

1 otherwise(11)

4 Experiments and Results

Experiments have been performed in order to estimate theperformance of our approach We applied the proposedtechnique over a large test set and compared the obtainedresults with well-known methods including global localand hybrid methods The comparison will estimate both thebinarization quality and the execution time

Firstly for parameterized methods a series of experi-ments have been performed in order to set their optimalparameter values Note PS(119898) the parameters set of a specificthresholding method 119898 For example PS(Niblack) = 119896 119908PS(Sauvola) = 119896 119908 119877 and so forth We try to find theoptimal values of PS(119898) giving the binarization results whichare closest to the ground truth images A specific range ofvalues [119886 119887] is first defined for each parameter To improvethe accuracy of the process we used a wide initial rangefor every parameter For Niblackrsquos method per example therange of the parameter 119908 is defined as [119886 119887] = [3 299] Afterthat we apply the binarization method 119898 with the differentvalues of PS(119898) from the predefined ranges on the test setdescribed in Section 41 We compare the binarization resultswith the ground truth images using the evaluation measuresdetailed in Section 42 A ranking of the obtained results isthen performed according to each measure separately Bycalculating the sum of all ranks we can infer the optimalset of parameter values as leading to the top ranks Theoptimal parameter values of the parameterized methods aresummarized in Table 1

41 Test Bases Two test sets have been used for the eval-uation of the proposed method The first is a public setcomposed of document images from the four collections pro-posed within the context of the competitions DIBCO 2009(httpusersiitdemokritosgrsimbgatDIBCO2009bench-mark) H-DIBCO 2010 (httpwwwiitdemokritosgrsimbgatH-DIBCO2010benchmark) DIBCO2011 (httputopiaduthgrsimipratikaDIBCO2011benchmark) and H-DIBCO2012 (httputopiaduthgrsimipratikaHDIBCO2012bench-

Table 1 Optimal parameters values of the parameterized methods

Binarization technique Optimal parameter valuesNiblack [11] 119896 = minus02 and 119908 = 35

Sauvola and Pietikainen [3] 119896 = 02 119877 = 128 and 119908 = 27

Nick 119896 = minus01 and 119908 = 19

Improved IGT 119899 = 50 and 119896 = 3

mark) These four collections contain a total of 50 realdocuments images (37 handwritten and 13 printed) comingfrom the collections of several libraries with the associatedground truth images All the images contain representativedegradations which appear frequently (eg variable back-ground intensity shadows smear smudge low contrastand bleed-through) Figure 3 shows some images from thesecollections

The second set of images is a synthetic collection com-posed of 150 synthetic images of documents constructedby the fusion of 15 different backgrounds and 10 binaryimages (Figure 4) The fusion is done by applying the imagemosaicing by superimposing technique for blending [21]Theidea is as follows we start with some images of documentsin black and white which represent the ground truth andwith some backgrounds extracted from old documents andwe apply a fusion procedure to get as many different imagesof old documents However Stathis et al in [22] proposedtwo different techniques for the blendingmaximum intensityand image averaging We adopt using the image averagingtechnique in order to have a more natural result

42 Evaluation Measures In addition to the execution timethe qualitative assessment is performed in terms of fivestandard evaluation measures used in DIBCO 2009 H-DIBCO 2010 DIBCO 2011 and H-DIBCO 2012 which are 119865-measure PSNR NRM MPM and DRD

Note TP TN FP and FN the true positive true negativefalse positive and false negative values respectively

421 119865-measure 119865-measure was introduced first by Chin-chor in [23]

119865-Measure = 2 times Recall times PrecisionRecall + Precision

(12)

where

Recall = TPTP + FN

Precision = TPTP + FP

(13)

422 PSNR PSNR is a similarity measure between twoimages However the higher the value of PSNR the higherthe similarity of the two images [24 25]

PSNR = 10 sdot log( 1198622

MSE) (14)

Advances in Multimedia 7

(a) (b)

(c) (d)

Figure 3 Images taken from the collections of (a) DIBCO 2009 (b) H-DIBCO 2010 (c) DIBCO 2011 and (d) H-DIBCO 2012

(a) (b) (c)

Figure 4 Image of document obtained by fusing binary image and background (a) Background from old document (b) ground truth binaryimage and (c) the resulting synthetic image

where

MSE =

sum119872

119909=1sum119873

119910=1(119868 (119909 119910) minus 119868

2(119909 119910))

2

119872119873

(15)

119868 and 1198682represent the two images matched 119872 and 119873

are there height and width respectively 119862 is the differencebetween foreground and background

423 NRM (Negative Metric Rate) NRM is based on thepixelwise mismatches between the ground truth and the

binarized image [26] It combines the false negative rateNRFNand the false positive rate NRFP It is denoted as follows

NRM =NRFN +NRFP

2(16)

with

NRFN =FN

FN + TP NRFP =

FPFP + TN

(17)

Contrary to 119865-measure and PSNR The better binariza-tion quality is obtained for lower NRM value

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Advances in Multimedia 5

(a) Original image (b) Otsursquos thresholding result

124

Num

ber o

f pi

xels

Gray levels

(c) Corresponding histogram

Figure 2 Global thresholding of degraded image

image into two classes foreground and background and sincea single threshold is not able to accomplish this task the useof more separation thresholds seems be a perfect solution

These two thresholds are estimated from the gray levelshistogram of the original image and represent the averageintensity of the foreground and background respectively

To obtain these two thresholds we first compute a globalthreshold 119879 using a global thresholding algorithm which canbe Otsursquos algorithm Kapur or any other global algorithmIn our approach we opted for the Otsu algorithm becausethis technique has shown its efficiency and overcame otherglobal methods in several comparative studies [17 18] 119879separates the gray levels histogram of the image into twoclasses foreground and background

1198791and 119879

2are estimated from 119879 Noting 119889min the

minimum distance between the average intensity of theforeground is represented by the first half of the histogramand the average intensity of the background is represented bythe second half

1198791= 119879 minus

119889min2

1198792= 119879 +

119889min2

(9)

32 Global Image Thresholding Using 1198791and 119879

2 After the

estimation of the two thresholds 1198791and 119879

2 all pixels having

a gray level higher than 1198792are transformed into white which

eliminates most of the image background and those whosegray level is less than 119879

1are colored in black Note 119868 the

resulting image These pixels are certainly foreground pixelsThe resulting image still contains some noise but all theforeground information is preserved

33 Local Thresholding of the Remaining Pixels The pixelsunprocessed in the previous step (those with a gray levelbetween 119879

1and 119879

2) may be from the foreground and thus

must be preserved likewise they may be backgroundrsquos ornoisersquos pixels and should be removed The decision to assignthe remaining pixels to one of the two classes foreground orbackground is performed using a local process by examiningthe neighborhood of these pixels To guarantee amore correctclassification we propose to apply several local thresholdingmethods In our experiments we chose the following meth-ods Niblack Sauvola and Nick since these methods wereranked in first places in several previous comparative studies[2 19 20]

For each pixel (119909 119910) of 119868 not yet classified we calculatelocally its new binary values (0 for black and 1 for white)

6 Advances in Multimedia

obtained by applying Niblackrsquos Sauvolarsquos and Nickrsquos methodsand we obtain thus three temporary images 119868

1 1198682 and 119868

3

respectively Each one of the three local methods computesthe binary value of each remaining pixel (119909 119910) by

119868119894(119909 119910) =

0 if 119868 (119909 119910) lt LT119894(119909 119910)

255 otherwise

1 le 119894 le 3

(10)

with LT1 LT2 LT3being the local threshold computed using

Niblackrsquos Sauvolarsquos and Nickrsquos methods respectivelyThe final binary value 119868

119887(119909 119910) of each remaining pixel is

that resulted of at least two of the three methods

119868119887(119909 119910) =

0 if3

sum

119894=1

119868119894(119909 119910) lt 2

1 otherwise(11)

4 Experiments and Results

Experiments have been performed in order to estimate theperformance of our approach We applied the proposedtechnique over a large test set and compared the obtainedresults with well-known methods including global localand hybrid methods The comparison will estimate both thebinarization quality and the execution time

Firstly for parameterized methods a series of experi-ments have been performed in order to set their optimalparameter values Note PS(119898) the parameters set of a specificthresholding method 119898 For example PS(Niblack) = 119896 119908PS(Sauvola) = 119896 119908 119877 and so forth We try to find theoptimal values of PS(119898) giving the binarization results whichare closest to the ground truth images A specific range ofvalues [119886 119887] is first defined for each parameter To improvethe accuracy of the process we used a wide initial rangefor every parameter For Niblackrsquos method per example therange of the parameter 119908 is defined as [119886 119887] = [3 299] Afterthat we apply the binarization method 119898 with the differentvalues of PS(119898) from the predefined ranges on the test setdescribed in Section 41 We compare the binarization resultswith the ground truth images using the evaluation measuresdetailed in Section 42 A ranking of the obtained results isthen performed according to each measure separately Bycalculating the sum of all ranks we can infer the optimalset of parameter values as leading to the top ranks Theoptimal parameter values of the parameterized methods aresummarized in Table 1

41 Test Bases Two test sets have been used for the eval-uation of the proposed method The first is a public setcomposed of document images from the four collections pro-posed within the context of the competitions DIBCO 2009(httpusersiitdemokritosgrsimbgatDIBCO2009bench-mark) H-DIBCO 2010 (httpwwwiitdemokritosgrsimbgatH-DIBCO2010benchmark) DIBCO2011 (httputopiaduthgrsimipratikaDIBCO2011benchmark) and H-DIBCO2012 (httputopiaduthgrsimipratikaHDIBCO2012bench-

Table 1 Optimal parameters values of the parameterized methods

Binarization technique Optimal parameter valuesNiblack [11] 119896 = minus02 and 119908 = 35

Sauvola and Pietikainen [3] 119896 = 02 119877 = 128 and 119908 = 27

Nick 119896 = minus01 and 119908 = 19

Improved IGT 119899 = 50 and 119896 = 3

mark) These four collections contain a total of 50 realdocuments images (37 handwritten and 13 printed) comingfrom the collections of several libraries with the associatedground truth images All the images contain representativedegradations which appear frequently (eg variable back-ground intensity shadows smear smudge low contrastand bleed-through) Figure 3 shows some images from thesecollections

The second set of images is a synthetic collection com-posed of 150 synthetic images of documents constructedby the fusion of 15 different backgrounds and 10 binaryimages (Figure 4) The fusion is done by applying the imagemosaicing by superimposing technique for blending [21]Theidea is as follows we start with some images of documentsin black and white which represent the ground truth andwith some backgrounds extracted from old documents andwe apply a fusion procedure to get as many different imagesof old documents However Stathis et al in [22] proposedtwo different techniques for the blendingmaximum intensityand image averaging We adopt using the image averagingtechnique in order to have a more natural result

42 Evaluation Measures In addition to the execution timethe qualitative assessment is performed in terms of fivestandard evaluation measures used in DIBCO 2009 H-DIBCO 2010 DIBCO 2011 and H-DIBCO 2012 which are 119865-measure PSNR NRM MPM and DRD

Note TP TN FP and FN the true positive true negativefalse positive and false negative values respectively

421 119865-measure 119865-measure was introduced first by Chin-chor in [23]

119865-Measure = 2 times Recall times PrecisionRecall + Precision

(12)

where

Recall = TPTP + FN

Precision = TPTP + FP

(13)

422 PSNR PSNR is a similarity measure between twoimages However the higher the value of PSNR the higherthe similarity of the two images [24 25]

PSNR = 10 sdot log( 1198622

MSE) (14)

Advances in Multimedia 7

(a) (b)

(c) (d)

Figure 3 Images taken from the collections of (a) DIBCO 2009 (b) H-DIBCO 2010 (c) DIBCO 2011 and (d) H-DIBCO 2012

(a) (b) (c)

Figure 4 Image of document obtained by fusing binary image and background (a) Background from old document (b) ground truth binaryimage and (c) the resulting synthetic image

where

MSE =

sum119872

119909=1sum119873

119910=1(119868 (119909 119910) minus 119868

2(119909 119910))

2

119872119873

(15)

119868 and 1198682represent the two images matched 119872 and 119873

are there height and width respectively 119862 is the differencebetween foreground and background

423 NRM (Negative Metric Rate) NRM is based on thepixelwise mismatches between the ground truth and the

binarized image [26] It combines the false negative rateNRFNand the false positive rate NRFP It is denoted as follows

NRM =NRFN +NRFP

2(16)

with

NRFN =FN

FN + TP NRFP =

FPFP + TN

(17)

Contrary to 119865-measure and PSNR The better binariza-tion quality is obtained for lower NRM value

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

6 Advances in Multimedia

obtained by applying Niblackrsquos Sauvolarsquos and Nickrsquos methodsand we obtain thus three temporary images 119868

1 1198682 and 119868

3

respectively Each one of the three local methods computesthe binary value of each remaining pixel (119909 119910) by

119868119894(119909 119910) =

0 if 119868 (119909 119910) lt LT119894(119909 119910)

255 otherwise

1 le 119894 le 3

(10)

with LT1 LT2 LT3being the local threshold computed using

Niblackrsquos Sauvolarsquos and Nickrsquos methods respectivelyThe final binary value 119868

119887(119909 119910) of each remaining pixel is

that resulted of at least two of the three methods

119868119887(119909 119910) =

0 if3

sum

119894=1

119868119894(119909 119910) lt 2

1 otherwise(11)

4 Experiments and Results

Experiments have been performed in order to estimate theperformance of our approach We applied the proposedtechnique over a large test set and compared the obtainedresults with well-known methods including global localand hybrid methods The comparison will estimate both thebinarization quality and the execution time

Firstly for parameterized methods a series of experi-ments have been performed in order to set their optimalparameter values Note PS(119898) the parameters set of a specificthresholding method 119898 For example PS(Niblack) = 119896 119908PS(Sauvola) = 119896 119908 119877 and so forth We try to find theoptimal values of PS(119898) giving the binarization results whichare closest to the ground truth images A specific range ofvalues [119886 119887] is first defined for each parameter To improvethe accuracy of the process we used a wide initial rangefor every parameter For Niblackrsquos method per example therange of the parameter 119908 is defined as [119886 119887] = [3 299] Afterthat we apply the binarization method 119898 with the differentvalues of PS(119898) from the predefined ranges on the test setdescribed in Section 41 We compare the binarization resultswith the ground truth images using the evaluation measuresdetailed in Section 42 A ranking of the obtained results isthen performed according to each measure separately Bycalculating the sum of all ranks we can infer the optimalset of parameter values as leading to the top ranks Theoptimal parameter values of the parameterized methods aresummarized in Table 1

41 Test Bases Two test sets have been used for the eval-uation of the proposed method The first is a public setcomposed of document images from the four collections pro-posed within the context of the competitions DIBCO 2009(httpusersiitdemokritosgrsimbgatDIBCO2009bench-mark) H-DIBCO 2010 (httpwwwiitdemokritosgrsimbgatH-DIBCO2010benchmark) DIBCO2011 (httputopiaduthgrsimipratikaDIBCO2011benchmark) and H-DIBCO2012 (httputopiaduthgrsimipratikaHDIBCO2012bench-

Table 1 Optimal parameters values of the parameterized methods

Binarization technique Optimal parameter valuesNiblack [11] 119896 = minus02 and 119908 = 35

Sauvola and Pietikainen [3] 119896 = 02 119877 = 128 and 119908 = 27

Nick 119896 = minus01 and 119908 = 19

Improved IGT 119899 = 50 and 119896 = 3

mark) These four collections contain a total of 50 realdocuments images (37 handwritten and 13 printed) comingfrom the collections of several libraries with the associatedground truth images All the images contain representativedegradations which appear frequently (eg variable back-ground intensity shadows smear smudge low contrastand bleed-through) Figure 3 shows some images from thesecollections

The second set of images is a synthetic collection com-posed of 150 synthetic images of documents constructedby the fusion of 15 different backgrounds and 10 binaryimages (Figure 4) The fusion is done by applying the imagemosaicing by superimposing technique for blending [21]Theidea is as follows we start with some images of documentsin black and white which represent the ground truth andwith some backgrounds extracted from old documents andwe apply a fusion procedure to get as many different imagesof old documents However Stathis et al in [22] proposedtwo different techniques for the blendingmaximum intensityand image averaging We adopt using the image averagingtechnique in order to have a more natural result

42 Evaluation Measures In addition to the execution timethe qualitative assessment is performed in terms of fivestandard evaluation measures used in DIBCO 2009 H-DIBCO 2010 DIBCO 2011 and H-DIBCO 2012 which are 119865-measure PSNR NRM MPM and DRD

Note TP TN FP and FN the true positive true negativefalse positive and false negative values respectively

421 119865-measure 119865-measure was introduced first by Chin-chor in [23]

119865-Measure = 2 times Recall times PrecisionRecall + Precision

(12)

where

Recall = TPTP + FN

Precision = TPTP + FP

(13)

422 PSNR PSNR is a similarity measure between twoimages However the higher the value of PSNR the higherthe similarity of the two images [24 25]

PSNR = 10 sdot log( 1198622

MSE) (14)

Advances in Multimedia 7

(a) (b)

(c) (d)

Figure 3 Images taken from the collections of (a) DIBCO 2009 (b) H-DIBCO 2010 (c) DIBCO 2011 and (d) H-DIBCO 2012

(a) (b) (c)

Figure 4 Image of document obtained by fusing binary image and background (a) Background from old document (b) ground truth binaryimage and (c) the resulting synthetic image

where

MSE =

sum119872

119909=1sum119873

119910=1(119868 (119909 119910) minus 119868

2(119909 119910))

2

119872119873

(15)

119868 and 1198682represent the two images matched 119872 and 119873

are there height and width respectively 119862 is the differencebetween foreground and background

423 NRM (Negative Metric Rate) NRM is based on thepixelwise mismatches between the ground truth and the

binarized image [26] It combines the false negative rateNRFNand the false positive rate NRFP It is denoted as follows

NRM =NRFN +NRFP

2(16)

with

NRFN =FN

FN + TP NRFP =

FPFP + TN

(17)

Contrary to 119865-measure and PSNR The better binariza-tion quality is obtained for lower NRM value

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Advances in Multimedia 7

(a) (b)

(c) (d)

Figure 3 Images taken from the collections of (a) DIBCO 2009 (b) H-DIBCO 2010 (c) DIBCO 2011 and (d) H-DIBCO 2012

(a) (b) (c)

Figure 4 Image of document obtained by fusing binary image and background (a) Background from old document (b) ground truth binaryimage and (c) the resulting synthetic image

where

MSE =

sum119872

119909=1sum119873

119910=1(119868 (119909 119910) minus 119868

2(119909 119910))

2

119872119873

(15)

119868 and 1198682represent the two images matched 119872 and 119873

are there height and width respectively 119862 is the differencebetween foreground and background

423 NRM (Negative Metric Rate) NRM is based on thepixelwise mismatches between the ground truth and the

binarized image [26] It combines the false negative rateNRFNand the false positive rate NRFP It is denoted as follows

NRM =NRFN +NRFP

2(16)

with

NRFN =FN

FN + TP NRFP =

FPFP + TN

(17)

Contrary to 119865-measure and PSNR The better binariza-tion quality is obtained for lower NRM value

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

8 Advances in Multimedia

Table 2 Average results obtained on the test dataset

Execution time (ms) 119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD

GlobalOtsu [6] 45875 80565 15716 883 2961 26208Kapur et al [8] 520828 81921 15692 8109 2059 23104IGT 309543 76556 14307 1011 2778 2873

LocalNiblack [11] 785041 6688 778 1147 18239 4342Sauvola and Pietikainen [3] 739142 8568 1862 795 1254 613Nick 724809 8222 19047 931 2201 527

Hybrid

Improved IGT 487943 78039 14728 1205 372 2278Gangamma and Srikanta [14] 73999157 69187 13469 2171 1543 3881Background Subtractionthresholding 1122057 68907 13432 2234 1775 4097

Tabatabaei and Bohlool [16] 75875 8582 2669 82 1183 3439Proposed Approche 141786 8744 2797 674 1025 3161

424 MPM (Misclassification PenaltyMetric) Themisclassi-fication penaltymetricMPMevaluates the binarization resultagainst the ground truth on an object-by-object basis [26]

MPM =MPFN +MPFP

2 (18)

where

MPFN =sum

FN119894=1

119889119894

FN119863

MPFP =sum

FP119895=1

119889119895

FP

119863

(19)

119889119894

FN and 119889119895FP denote the distance of the 119894th false negativeand the 119895th false positive pixel from the contour of the groundtruth segmentation The normalization factor 119863 is the sumover all the pixel-to-contour distances of the ground truthobject A low MPM score denotes that the algorithm is goodat identifying an objectrsquos boundary

425 DRD (Distance Reciprocal Distortion Metric) DRD isan objective distortion measure for binary document imagesand it was proposed by Lu et al in [25]Thismeasure properlycorrelates with the human visual perception and it measuresthe distortion for all the 119878 flipped pixels as follows

DRD =sum119878

119896=1DRD119896

NUBN (20)

NUBN is the number of the nonuniform 8 times 8 blocks in theGT image

DRD119896is the distortion of the 119896th flipped pixel of coordi-

nate (119909 119910) and it is calculated using a 5 times 5 normalized weightmatrix119882

119873119898 This last is defined in [25] as follows

119882119873119898

(119894 119895) =119882119898(119894 119895)

sum119898

119894=1sum119898

119895=1119882119898(119894 119895)

(21)

such that

119882119898(119894 119895) =

0 for 119894 = 119894119862 119895 = 119895

119862

1

radic(119894 minus 119894119862)2

+ (119895 minus 119895119862)2

otherwise

(22)

With119898 = 5 and 119894119862= 119895119862= (1 + 119898)2

DRD119896is given as follows

DRD119896=

2

sum

119894=minus2

2

sum

119895=minus2

1003816100381610038161003816119868GT (119909 + 119894 119910 + 119895) minus 119868119861 (119909 119910)1003816100381610038161003816

times 119882119873119898

(119894 + 2 119895 + 2)

(23)

43 Results andDiscussion Theaverage results obtained overthe test images are summarized in Table 2 The final rankingof the compared methods is shown in Table 3 which alsosummarizes the partial ranks of each method according toeach evaluation measure and the sum of ranks

From Tables 2 and 3 it is clear that our proposed methodis totally ranked first and has the best performances accordingto all measures of binarization quality It exceeded localmethods such as the famous Sauvola and Pietikainenmethodwhich ranked 3rd in our experiments and even other hybridtechniques Indeed the combination of three local threshold-ing techniques enabled a more robust determination of thebinary value of each pixel by taking the most likely value

Regarding the execution time our method is very fastcompared to local methods (about 52 times more thanSauvola and Pietikainenrsquos method) which enabled us to earnabout than 98 of the execution time This is logical becauseonly a portion of the pixels (having a gray level between thetwo thresholds 119879

1and 119879

2) is locally analyzed

5 Conclusion

In this paper we tackled the problem of foregroundback-ground separation from the images of historical documents

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

Advances in Multimedia 9

Table 3 Final ranking of the compared methods on the test set

Rank Method Execution time(ms)

119865-measure() PSNR NRM

(times10minus2)MPM(times10minus3) DRD Sum of ranks

1 Proposed Approche 6 1 1 1 1 1 112 Tabatabaei and Bohlool [16] 10 2 2 4 2 2 223 Sauvola and Pietikainen [3] 8 3 4 2 3 4 244 Kapur et al [8] 4 5 6 3 4 6 285 Nick 7 4 3 6 5 3 286 Otsu [6] 2 6 5 5 7 7 327 IGT 1 8 8 7 6 8 388 Improved IGT 3 7 7 9 8 5 399 Gangamma and Srikanta [14] 9 9 9 10 9 9 55

10 Thresholding by BackgroundSubtraction 5 10 10 11 10 10 56

11 Niblack [11] 11 11 11 8 11 11 63

We proposed a hybrid approach of degraded documentimages binarization The proposed approach runs on twopasses Firstly a global thresholding using Otsursquos algorithm isapplied on the entire image and two different thresholds aredetermined All pixels below the first threshold are preservedand all pixels higher than the second threshold are eliminatedas they represent surely background pixels The remainingpixels are then processed locally based on their neighborhoodinformation In this step three local thresholding methodsare combined in order to obtain a more accurate decisionSince the number of pixels processed locally is very smallcompared to the total number of pixels the time required forthe binarization is reduced considerably without decreasingthe performances To validate our approach we comparedit with the state-of-the-art methods from the literature andthe obtained results on standard and synthetic collections areencouraging and confirm our approach

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

References

[1] N Arica and F T Yarman-Vural ldquoAn overview of characterrecognition focused on off-line handwritingrdquo IEEE Transac-tions on Systems Man and Cybernetics Part C Applications andReviews vol 31 no 2 pp 216ndash233 2001

[2] K Khurshid I Siddiqi C Faure and N Vincent ldquoComparisonof Niblack inspired binarization methods for ancient docu-mentsrdquo in 16th International conference on Document Recogni-tion and Retrieval Proceedings of SPIE San Jose Calif USAJanuary 2009

[3] J Sauvola and M Pietikainen ldquoAdaptive document imagebinarizationrdquo Pattern Recognition vol 33 no 2 pp 225ndash2362000

[4] B Fernando and S Karaoglu ldquoExtreme value theory based textbinarization in documents and natural scenesrdquo in Proceedings of

the 3rd IEEE International Conference onMachineVision (ICMVrsquo10) pp 144ndash151 Hong Kong December 2010

[5] M Sezgin and B Sankur ldquoSurvey over image thresholdingtechniques and quantitative performance evaluationrdquo Journal ofElectronic Imaging vol 13 no 1 pp 146ndash168 2004

[6] N Otsu ldquoA threshold selection method from gray-level his-togramsrdquo IEEE Transactions on Systems Man and Cyberneticsvol 9 no 1 pp 62ndash66 1979

[7] F R D Velasco ldquoThresholding using the ISODATA clusteringalgorithmrdquo IEEE Transactions on Systems Man and Cybernet-ics vol 10 no 11 pp 771ndash774 1980

[8] J N Kapur P K Sahoo and A K C Wong ldquoA new methodfor gray-level picture thresholding using the entropy of thehistogramrdquo Computer Vision Graphics and Image Processingvol 29 no 3 pp 273ndash285 1985

[9] E Kavallieratou ldquoA binarization algorithm specialized on docu-ment images and photosrdquo in Proceedings of the 8th InternationalConference onDocument Analysis and Recognition pp 463ndash467September 2005

[10] J Bernsen ldquoDynamic thresholding of grey-level imagesrdquo inProceedings of the 8th International Conference on PatternRecognition pp 1251ndash1255 Paris France 1986

[11] WNiblackAn Introduction toDigital Image Processing PrenticeHall Englewood Cliffs NJ USA 1986

[12] T Sari A Kefali and H Bahi ldquoAn MLP for binarizing imagesof old manuscriptsrdquo in Proceedings of the 13th InternationalConference on Frontiers in Handwriting Recognition (ICFHRrsquo12) pp 247ndash251 Bari Italy September 2012

[13] E Kavallieratou and S Stathis ldquoAdaptive binarization of histor-ical document imagesrdquo in Proceedings of the 18th InternationalConference on Pattern Recognition (ICPR rsquo06) vol 3 pp 742ndash745 Hong Kong August 2006

[14] B Gangamma and M K Srikanta ldquoEnhancement of degradedhistorical kannada documentsrdquo International Journal of Com-puter Applications vol 29 no 11 pp 1ndash6 2011

[15] G Leedham C Yan K Takru J H N Tan and LMian ldquoCom-parison of Some thresholding algorithms for textbackgroundsgmentation in difficult document imagesrdquo in Proceedings ofthe 7th International Conference on Document Analysis andRecognition pp 859ndash864 2003

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

10 Advances in Multimedia

[16] S A Tabatabaei and M Bohlool ldquoA novel method for bina-rization of badly illuminated document imagesrdquo in Proceedingsof the 17th IEEE International Conference on Image Processing(ICIP rsquo10) pp 3573ndash3576 Hong Kong China September 2010

[17] P K Sahoo S Soltani and A K C Wong ldquoA survey ofthresholding techniquesrdquoComputer Vision Graphics and ImageProcessing vol 41 no 2 pp 233ndash260 1988

[18] M A Ramırez-Ortegon and R Rojas ldquoUnsupervised eval-uation methods based on local gray-intensity variances forbinarization of historical documentsrdquo in Proceedings of the 20thInternational Conference on Pattern Recognition (ICPR rsquo10) pp2029ndash2032 IEEE Computer Society Istanbul Turkey August2010

[19] A J O Trier ldquoGoal-directed evaluation of binarization meth-odsrdquo IEEE Transactions on Pattern Analysis and MachineIntelligence vol 17 no 12 pp 1191ndash1201 1995

[20] A Kefali T Sari and M Sellami ldquoEvaluation of severalbinarization techniques for old Arabic documents imagesrdquoin Proceedings of International Symposium on Modelling andImplementation of Complex Systems Constantine Algeria 2010

[21] L G Brown ldquoA survey of image registration techniquesrdquo ACMComputing Surveys vol 24 no 4 pp 325ndash376 1992

[22] P Stathis E Kavallieratou and N Papamarkos ldquoAn evaluationsurvey of binarization algorithms on historical documentsrdquoin Proceedings of the 19th International Conference on PatternRecognition pp 742ndash745 December 2008

[23] N Chinchor ldquoMUC-4 evaluation metricsrdquo in Proceedings of the4th Message Understanding Conference pp 22ndash29 1992

[24] B Gatos K Ntirogiannis and I Pratikakis ldquoDIBCO 2009document image binarization contestrdquo International Journal onDocument Analysis and Recognition vol 14 no 1 pp 35ndash442011

[25] H Lu A C Kot and Y Q Shi ldquoDistance-reciprocal distortionmeasure for binary document imagesrdquo IEEE Signal ProcessingLetters vol 11 no 2 pp 228ndash231 2004

[26] J Aguilera H Wildenauer M Kampel M Borg D Thirdeand J Ferryman ldquoEvaluation of motion segmentation qualityfor aircraft activity surveillancerdquo in Proceedings of the 2ndJoint IEEE International Workshop on Visual Surveillance andPerformance Evaluation of Tracking and Surveillance (VS-PETSrsquo05) pp 293ndash300 October 2005

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of

International Journal of

AerospaceEngineeringHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

RoboticsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Active and Passive Electronic Components

Control Scienceand Engineering

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

RotatingMachinery

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporation httpwwwhindawicom

Journal ofEngineeringVolume 2014

Submit your manuscripts athttpwwwhindawicom

VLSI Design

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Shock and Vibration

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Civil EngineeringAdvances in

Acoustics and VibrationAdvances in

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Electrical and Computer Engineering

Journal of

Advances inOptoElectronics

Hindawi Publishing Corporation httpwwwhindawicom

Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

SensorsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Modelling amp Simulation in EngineeringHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Chemical EngineeringInternational Journal of Antennas and

Propagation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Navigation and Observation

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

DistributedSensor Networks

International Journal of