bari-th 391/00 ir

16

Upload: others

Post on 10-Dec-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

BARI-TH 391/00

Chaotic neural networks clustering: application to anti-personnel

mines detection by dynamical IR imaging

L. Angelini, F. De Carlo, M. Mannarelli, C. Marangi, G. Nardulli, M. Pellicoro,

S. Stramaglia

Dipartimento Interateneo di Fisica

Istituto Nazionale di Fisica Nucleare, Sezione di Bari

via Amendola 173, 70126, Bari, Italy

(July 2000)

Abstract

Nonparametric approaches to clustering may represent the optimal strategy

when there is no a priori knowledge about data structures. A method has been

recently proposed that provides a hierarchical solution to the clustering prob-

lem under very general assumptions, relying on the cooperative behavior of a

chaotic neural network where neurons update is performed by logistic maps.

The method was successfully tested on arti�cial and real-life problems: here

we report the results of an application to detection of buried anti-personnel

mines by dynamic thermography. Dynamic thermography consists in pro-

cessing temporal sequences of infrared images taken from the same scene,

submitted to either arti�cial or natural temperature variations. The aim is

to obtain an image segmentation where mine and soil can be discriminated

due to di�erent time evolution of their thermal properties. The approach

here proposed allows to get the correct classi�cation by analysing very short

image sequences, thus allowing a fast acquisition time. The e�ectiveness of

1

the method is demonstrated on image sequences of plastic AP mines taken

from realistic mine�elds.

PACS: 42.30.s ; 84.35 ; 05.45.G

Keyword: Mines detection, dynamic thermography, chaotic neural networks,

clustering

I. INTRODUCTION

Considerable e�orts are being made on a world-wide basis to develop a de�nitive solu-

tion to the landmine detection for humanitarian applications. As it is well known civilian

demining is a demanding task requiring approximately 100% of eÆciency. Even if recently

developed techniques (e.g. Nuclear Quadrupole Resonance, [1]) seems to give very promis-

ing results, so far a de�nitive solution to the problem is still to be found. Due to the large

number of variables (shape, explosive contents, soil conditions) to be taken into account

the most reliable way to approximate such high eÆciencies is through combination of dif-

ferent sensors and data analysis techniques [2]. In the present paper we focus on dynamic

thermography by infrared (IR) imagers, which measure the time evolution of surface tem-

perature di�erences due to thermal properties of the buried landmines and the surrounding

soil. Dynamic termography consists in processing image sequences which have been gener-

ated from the same scene, submitted to either naturally or arti�cially induced temperature

changes. Corresponding pixels in the image sequence will lie on a temperature curve which

may vary according to the thermal properties of the object they are representing in the

scene. The whole sequence must be processed to get a segmentation of the original scene

and a consequent classi�cation of the objects in it (mine, soil, false alarms).

Here we describe an approach to mine detection from IR images which exploits a recently

proposed clustering algorithm [3] based on coupled chaotic maps. The main advantage of the

proposed methodology is that it works on a small sequence of IR images taken at 4 seconds

time intervals, thus allowing a fast acquisition time as compared to previous approaches

2

which require 24 hours. The paper is organized as follows. In the next section after a brief

introduction to the clustering problem, the chaotic maps algorithm is reviewed. In section

3, the data set on which we perform our analysis is described. Our approach to the mine

detection and the results are described in section 4. Some conclusions are drawn in Section

5.

II. CLUSTERING PROBLEM

Clustering is an important technique in exploratory analysis of information or data: it

consists of partitioning N given points into K groups (clusters) so that two points belonging

to the same group are, in some sense, more akin than other two in di�erent groups [4]; it has

applications in several �elds such as pattern recognition [5], learning [6], astrophysics [7],

biology [8], economy [9], analysis of spatiotemporal signals [10] and more. This problem is

inherently ill-posed [11], i.e. any data set can be clustered in drastically di�erent ways, with

no absolute criterion to prefer one clustering over another. The most important sources of

ambiguity are the choice of the number of clusters and the fact that a satisfactory clustering

of the data depends on the desired resolution.

In parametric approaches [12] the data are viewed as coming from a mixture of prob-

ability distributions, each representing a di�erent cluster: prior knowledge of the clusters'

structure is used to choose the parametric form of distributions. In many cases of interest,

however, there is no a priori knowledge about the data structure. Nonparametric approaches,

which make fewer assumptions about the model, are mandatory in these cases [4]. A deeper

comprehension of nonparametric clustering has been achieved by works of Domany and

coworkers [13], who proposed to change the similarity index of the problem from the inter-

point distance to a many body similarity measure, i.e. the spin-spin correlation function of

a statistical spin model living on the lattice constituted by data-points. In their algorithm,

called superparamagnetic clustering (SPC), the temperature of the statistical model controls

the resolution at which the data are clustered.

3

The method proposed in [3] employs the mutual information of a system of coupled

chaotic maps as the many body similarity measure. Since a chaotic map lattice may be

seen as a particular realization of a neural network, in the following we will describe this

algorithm in the frame of neural networks [14].

Given a set of N points frig in a D-dimensional space, a real neuronic variable

xi 2 [�1; 1] is assigned to each point and synaptic strengths decreasing with distance

Jij = exp (�[ri � rj]2=2a2) , where a is the local length scale [15], are de�ned [16] . The

synchronous time evolution law for neurons is given by:

xi(t+ 1) = f

0@ 1

Ci

Xj 6=i

Jijxj(t)

1A ; (1)

where Ci =P

j 6=i Jij, and the transfer function is the logistic map f(x) = 1�2x2 (examples of

complex dynamics arising from the choice of a nonmonotonic transfer function are described

in [17]).

The stationary regime of Eqs.(1) corresponds to a macroscopic attractor which is inde-

pendent of the initial conditions. To study the correlation properties of the system, the

mutual information Iij [18], between neurons xi and xj, can be evaluated and it turns out

that the macroscopic attractor is characterized by long-range correlations [3]. The algo-

rithm identi�es clusters with the linked components of the graph one obtains drawing a link

between all the pairs of neurons whose mutual information exceeds a threshold �. Since

long-range correlations are present, all the scales, in the data set, contribute to the mutual

information pattern fIijg and the threshold � controls the resolution at which the data are

clustered. Accordingly, hierarchical clustering of data is achieved by successive thresholding

of the mutual information as described in [3]. We note that in a recent paper [19] randomly

coupled chaotic dynamical networks were studied and it was pointed out that the architec-

ture of the network biases the formation of almost synchronized clusters of maps. In the

neural system here considered the formation of clusters of almost synchronized neurons is

biased by the pattern of the synaptic couplings. It is interesting to compare this approach

to clustering with a popular neural network model for the same problem, i.e. the Self Or-

4

ganizing Feature Map (SOM) [20]. SOM maps input data points on its output layer: each

cluster is represented by an output neuron and its correspondent prototype vector (connec-

tions) that is determined by the training stage. After the training stage, each data-point is

assigned to the cluster corresponding to the nearest prototype vector. It is then clear that

SOM implicitly assumes that each cluster can be represented by a prototype, whereas the

algorithm, here reviewed, makes no a priori assumptions about the geometry of clusters. In

the chaotic neural model, the trajectories of neurons are chaotic and do not converge to an

asymptotic �xed point: the state of the neuron is meaningless in this context. Rather, the

information lies in the degree of synchronization of the chaotic trajectories of neurons, which

determine whether two neurons belong to the same class or not. It is worthy of further in-

vestigation the application of this perspective on chaotic neural networks to other functions

such as supervised classi�cation and associative memory.

Let us now illustrate the behavior of the algorithm on a toy model which can be con-

sidered as a mean-�eld version of the two-clusters problem. Since many of the diÆcult

clustering problems are of extremely high dimensionality, a mean-�eld setting of the fol-

lowing type is relevant to the problem. Consider two sets of N neurons and let J be the

synaptic coupling between pairs of neurons in di�erent groups, while pairs of neurons in the

same group are assumed to be connected by unit coupling. Depending on the value of J

and the threshold �, three kinds of output are possible for the algorithm: 1) The algorithm

identi�es a single big cluster, i.e. it does not resolve the two groups. 2) The algorithm

correctly identi�es two clusters corresponding to the two groups, 3) The algorithm identi�es

more than two clusters, i.e. at least one of the two groups is not recognized as a single

cluster. By numerical analysis we constructed the diagram in Fig. 1, where the occurrence

of the above described behaviors, in the � � J plane, is shown. For J greater than 0:15 the

algorithm always recognizes a single cluster: this is a consequence of the mean-�eld avour

of this toy model which favours the sinchronization of neurons. The line separating regions

1 and 2 becomes indistinguishable from the J axis at J � 0:06. The stability, with respect

to �, is a clear indication of the optimal partition: the two-clusters solution appears stable

5

for a wide range of J values.

III. DATA SET

The sequences of images that have been processed are relative to plastic anti-personnel

(AP) mines (PMN) and are part of a larger data set which has been made accessible by the

courtesy of the Royal Military Academy of Belgium and the Hudem Project. The original

data set has been produced by infrared imaging on realistic mine �elds at Meerdaal, a

Belgian military base. Four identical �elds of 9m�5m have been reproduced on di�erent

soil types: sand, gravel, local ground and a mixture of the �rst three [21]. On each �eld two

AP mine types, M35BG and PMN, are regularly spaced on a grid together with false alarm

objects (can, stones, plastic bottles). Some PMN mines are buried at increasing depth,

starting from 5 cm below the surface. The data set at our disposal consists of a sequence

of images showing an area of interest of 1m�1m containing a buried mine as well as false

alarm objects. The IR image sequence is generated by following a 24 hour temperature cycle

of a mine laying on the ground surface or buried at di�erent depths into the soil. Images

are taken in groups of ten consecutive shots (the time interval separating two shots is 4 s),

repeated every 30 minutes for the whole day. The average temperature di�erence (DT)

between mines and soil has been measured and recorded for each group of images. Here we

are concerned with image sequences of a PMN mine buried just below the surface at 5cm

depth, either in a sand or in a gravel soil. In both cases the data set has been generated by

using a IR camera operating in the spectral band of 3 to 5 micron. A sample of four images,

taken at di�erent times during the day, is shown in �g. 2. In this case, relative to a PMN

mine buried in a sand soil, the mine is located in the low left corner of each image. As it is

clear from the image sequence sample, the AP mines can be seen only on a few images of

the whole sequence and even in this case the distinction between mine and background is

unclear.

6

IV. DATA ANALYSIS

Previous approaches to dynamical IR imaging for mine detection make use of the whole

24 hours image sequence from which relevant information can be extracted e.g. through

principal component analysis [22]. However this is often unnecessary since we observe that

most of useful information can be extracted by short image sequences on a time scale of

a few minutes, provided that the average temperature di�erence between mine and soil is

signi�cant. We then consider groups of 10 consecutive images corresponding to measured

DT not smaller than 0.3 ÆC. Moreover, it can be noticed that on the short time scale we are

considering (i.e. the time interval of ten consecutive shots) the variance in the time domain,

calculated on corresponding pixels of the image series, is smaller for pixels in the mine

region than for ones belonging to the soil region. This suggests the choice of temporal mean

and variance as relevant features for data representation. Our approach associates to each

pixel a two dimensional feature vector whose components correspond to mean and variance

calculated over the ten images; a clustering stage is then performed in the two dimensional

feature space so as to group pixels characterized by similar thermal properties. The stable

partition of pixels, depicted in the original frame as a segmented image, is expected to

provide clusters corresponding to soil and mines respectively.

The approach here proposed can be summarized in the following three steps:

- features extraction: mean and variance in the time domain are extracted from the selected

image sequence; the two features are then linearly transformed to new variables with

zero mean and unit variance;

- clustering: chaotic neural network clustering is performed to get a partition of the feature

space; the clustering solution corresponding to the most stable partition is selected;

- segmentation: clustering results are represented in the original image space leading to

segmentation in two regions, mine and soil, respectively.

7

We applied this methodology to a data set of N=90�90 points, where each point corresponds

to a pixel in a 90�90 image obtained by compression of the original image series. The size of

the original data set has been reduced by image compression (i.e. by retaining the median

pixel in a 3�3 window on each original image of size 270�270) in order to save memory

and speed up the clustering algorithm run. The additional step of compression is obviously

unnecessary and even to be avoided for image sequences of extremely bad quality. In �g. 3a

the feature space of a series of ten consecutive images, corresponding to a PMN buried mine

in sand soil, is depicted. In �g. 3b the stable cluster partition we obtained is represented:

we notice the presence of a very big cluster containing the most part of the data points and

another relevant cluster (made of 217 points) in the region of high mean and low variance.

All the remaining data points form small clusters of size smaller than 20 points.

In order to show which regions of the original scene are described by the clustering

results, cluster labels for each data point are reported in an arbitrary grey level scale in the

original image space. Apart from some edge e�ects, as it is clear from �g. 4, the biggest

cluster corresponds to the background (soil) whereas the other cluster represents the mine.

For sake of clearness, clusters of size smaller than the two main ones are given the same grey

level than the background since they act as a noise e�ect on the �nal image.

The presence of the mine is marked here as that of a bright object with low variance in

a darker environment. Of course that is related to the positive high temperature di�erence

between mine and soil at the acquisition time of the data set considered (DT in the range

[0.9,1.0]). Similar results can be obtained for the acquisition time when temperature di�er-

ences are inverted in sign, e.g. during the night. We performed the same analysis on a PMN

mine buried in a gravel soil. Again, the mine is located at the low left corner of each image

and results are similar to the ones already obtained on sand soil. In Fig. 5 are shown the

results obtained for the \worst" case of low temperature di�erences measured during the

night (DT in the range [0.3,0.5]), together with the �rst image of the processed sequence.

The described approach has been successfully applied to the whole data set of images

taken from gravel and sand mine�elds, with measured temperature di�erences ranging from

8

0 to 1.0 ÆC. We observe that mine detection through chaotic network clustering can be

easily obtained for DT greater than 0.3 ÆC, even in the absence of any pre-processing of the

original image sequence. For lower values of temperature di�erences, i.e. when the content

of information displayed by IR images becomes less and less signi�cant, further steps of data

pre-processing together with introduction of additional features may be required.

V. CONCLUSION

In this paper we have presented an application of a recently proposed chaotic network

clustering to image segmentation of IR image sequences for landmines detection. The ap-

proach consists in performing clustering in a suitable feature space extracted from an IR

image sequence and a subsequent segmentation in the original image space. We have shown

that the proposed methodology is e�ective on small sequences of IR images taken at 4 sec-

onds time intervals, provided that the average temperature di�erence between mine and

soil is at least 0:3 ÆC, thus requiring a very short acquisition time. The approach has been

tested on a data set of real images of plastic buried AP mines. The integration of the present

methodology with other sources of information is matter for further work.

ACKOWLEDGEMENTS

We thank Marc Acheroy and the RMA(SIC) Hudem project for providing the Meerdael

data set and Bart Scheers for all the useful information about the same data set. Discussions

with Palma Blonda, Guido Pasquariello and Giuseppe Satalino are warmly acknowledged.

9

REFERENCES

[1] News Release

of Defence Advanced Research Projects Agency, USA, February 21, 2000. Program

details on http://www.darpa.mil/ato/programs/uxo/programs/qm/Default.htm.

[2] EUREL International Conference on \The detection of abandoned land mines", Venue

EICC, Edinburgh, UK, 1996.

[3] L. Angelini, F. De Carlo, C. Marangi, M. Pellicoro, S. Stramaglia, Phys. Rev. Lett. 85,

(2000) 554.

[4] See, e.g., K. Fukunaga, Introduction to Statistical Pattern Recognition, Academic Press,

San Diego, 1990.

[5] R.O. Duda and P.E. Hart, Pattern Classi�cation and Scene Analysis Wiley, New York,

1973.

[6] J. Moody and C.J. Darken, Neural Computation 1, (1989) 281.

[7] A. Dekel and M.J. West, Astrophys. J. 288, (1985) 411.

[8] G. Getz, E. Levine, E. Domany, M. Q. Zhang, Physica A 279, (2000) 457.

[9] See, e.g., M. Marsili, preprint cond-mat/0003241, and references therein.

[10] A. Hutt, M. Svensen, F. Kruggel, R. Friedrich, Phys. Rev. E 61, (2000) R4691.

[11] A mathematical problem is well posed if a unique solution exists that depends con-

tinuously on the data (J. Hadamard, Sur les probl�emes aux d�eriv�ees partielles et leur

signi�cation physique, Princeton University Bullettin, Princeton, 1902, p. 13); the clus-

tering problem violates the uniqueness requirement.

[12] See, e.g., K. Rose, E. Gurewitz, G.C. Fox, Phys. Rev. Lett. 65, (1990) 945; N. Barkai,

H. Sompolinski, Phys. Rev. E 50, (1994) 1766.

10

[13] M. Blatt, S. Wiseman, E. Domany, Phys. Rev. Lett. 76, (1996) 3251; Neural Computa-

tion 9, (1997) 1805; E. Domany, Physica A 263, (1999) 158.

[14] For a recent review, see A.C.C. Coolen Statistical Mechanics of Recurrent Neural Net-

works , preprints cond-mat/0006010, 0006011.

[15] The local length scale a can be determined in many di�erent ways, for example as the

average distance between k-nearest neighboring sites, two points i and j being k-nearest

neighhboring if j is one of the k nearest points of i and i is one of the k nearest points

of j. The partition provided by the clustering algorithm is quite insensitive to the value

of a.

[16] Obviously, di�erent de�nitions of distance between pairs of data points are possible, the

Euclidean being the most common one.

[17] D. Boll�e, B. Vinck, Physica A 223, (1996) 293; D. Caroppo, S. Stramaglia, Phys. Lett.

A 246, (1998) 55; D. Caroppo, M. Mannarelli, G. Nardulli, S. Stramaglia, Phys. Rev.

E 60, (1999) 2186.

[18] The mutual information between neurons xi and xj is evaluated as follows. In the

stationary regime a �nite length binary string can be associated to each neuron by

recording its trajectory for a suÆciently long time interval and assigning 0 (1) if xi(t) is

less (greater) than zero. The mutual information is then given by Iij = Hi +Hj �Hij,

where Hi (Hj) is the Boltzmann entropy of the i � th (j � th) string, and Hij is the

joint entropy of the two strings. If neurons xi and xj evolve independently then Iij = 0;

if the two neurons are exactly synchronized then the mutual information achieves its

maximum value, i.e. ln 2.

[19] S.C. Manrubia, A.S. Mikhailov, Phys. Rev. E 60, (1999) 1579.

[20] T. Kohonen, Self-Organizing Maps, Springer Verlag, Berlin, 1995.

[21] B. Scheers, M. Piette, A. Vander Vorst, Proceedings of IEE Second International Con-

11

ference on the Detection of Abandoned Landmines (MD'98), 12-14 October 1998, Ed-

inburgh, UK.

[22] M. Schachne, L. van Kempen, D. Milojevic, H. Sahli, Ph. Van Ham, M. Acheroy, J.

Cornelis , Proceedings of IEE Second International Conference on the Detection of

Abandoned Landmines (MD'98), 12-14 October 1998, Edinburgh, UK, 124 (1998)

12

FIGURES

0

0.05

0.1

0.15

0.2

0.25

0.3

0 0.1 0.2 0.3 0.4 0.5 0.6 θ

J

1

2

3

FIG. 1. The output from chaotic network clustering on the two cluster problem is represented

in the � � J plane.

13

FIG. 2. A sample of four IR images of a buried PMN mine taken at di�erent time during a 24h

cycle. The mine is located in the low left corner of each image.

14

-3

-2

-1

0

1

2

3

-2 0 2

8100σ

µ

~

~

(a)

-3

-2

-1

0

1

2

3

-2 0 2

7512217

σ

µ

~

~

(b)FIG. 3. (a) Feature space extracted from a sequence of 10 images of size N=90 � 90. Each

point in the plot corresponds to a pixel in the original image space. Mean and standard deviation

in the time domain are linearly transformed to give the two displayed features ~� and ~� , with zero

mean and unit variance. (b) Clustering results are shown in the feature space. Two big cluster of

7512 points (background) and 217 points (mine) can be distinguished. Small clusters of size up to

20 points are given the same grey level than the background.

FIG. 4. Left: clustering results corresponding to Fig. 3 are reported in arbitrary grey scale

in the original image space. Right: the �rst image of the processed sequence is displayed for

comparison. The image shows a buried PMN mine in sand soil, located in the low left corner. The

images in the sequence are compressed and have size N=90�90.

15

FIG. 5. Left: clustering results are reported in arbitrary grey scale in the original image space.

Right: the �rst image of the processed sequence is displayed for comparison. The image shows a

buried PMN mine in gravel soil, located in the low left corner. The images in the sequence are

uncompressed and have size N=270�270.

16