neural nets wirn09 · gabriele colombini, davide sottara, luca luccarini and paola mello vii. ......
TRANSCRIPT
ISBN 978-1-60750-072-8ISSN 0922-6389
Frontiersin
ArtificialIntelligence
andApplications
NEU
RAL
NET
SW
IRN09
204
NEURAL NETSWIRN09
9 781607 500728
ISBN 978-1-60750-072-8
L.C. Jain
Knowledge-Based Intelligent
Engineering Systems Centre
SCT-Building
University of South Australia
Adelaide,
Mawson Lakes
SA 5095, Australia
About the book series FAIA-Knowledge-Based Intelligent Engineering Systems
(KBIES)
Editors: L.C. Jain and R.J. Howlett
Aim of the SeriesThe aim of the KBIES series is to report on the tremendous range of applications
arising out of investigations into intelligent systems, coupled with the latest generic
research that makes these applications possible. The series provides a leading
resource for researchers, engineers, managers and all others concerned with this
area of research, or wanting to know more about it.
Invitation to Propose a ContributionProposals are invited for text books, edited books, hand books and conference
proceedings. Proposals may be sent to the series editors:
For details please visit:
www.kesinternational.org/bookseries.php
www.iospress.com
R.J. Howlett
Director of Brighton KTP Centre
Head of Intelligent Signal Processing
Labs
School of Engineering
University of Brighton
Brighton
BN2 4GJ, United Kingdom
Bruno ApolloniSimone BassisCarlo F. Morabito
B. Apolloni, S. Bassis and C
.F. Morabito (Eds.)
Proceedings of the 19th Italian Workshopon Neural Nets, Vetri Sul Mare, Salerno, ItalyMay 28-30-2009
faia 204 apolloni:faia204 apollini 06-11-09 12:01 Pagina 1
NEURAL NETS WIRN09
Frontiers in Artificial Intelligence and
Applications
Volume 204
Published in the subseries
Knowledge-Based Intelligent Engineering Systems
Editors: L.C. Jain and R.J. Howlett
Recently published in KBIES:
Vol. 203. M. Džbor, Design Problems, Frames and Innovative Solutions
Vol. 196. F. Masulli, A. Micheli and A. Sperduti (Eds.), Computational Intelligence and
Bioengineering – Essays in Memory of Antonina Starita
Vol. 193. B. Apolloni, S. Bassis and M. Marinaro (Eds.), New Directions in Neural Networks –
18th Italian Workshop on Neural Networks: WIRN 2008
Vol. 186. G. Lambert-Torres et al. (Eds.), Advances in Technological Applications of Logical
and Intelligent Systems – Selected Papers from the Sixth Congress on Logic Applied
to Technology
Vol. 180. M. Virvou and T. Nakamura (Eds.), Knowledge-Based Software Engineering –
Proceedings of the Eighth Joint Conference on Knowledge-Based Software
Engineering
Vol. 170. J.D. Velásquez and V. Palade, Adaptive Web Sites – A Knowledge Extraction from
Web Data Approach
Vol. 149. X.F. Zha and R.J. Howlett (Eds.), Integrated Intelligent Systems for Engineering
Design
Recently published in FAIA:
Vol. 202. S. Sandri, M. Sànchez-Marrè and U. Cortés (Eds.), Artificial Intelligence Research
and Development – Proceedings of the 12th International Conference of the Catalan
Association for Artificial Intelligence
Vol. 201. J.E. Agudo et al. (Eds.), Techniques and Applications for Mobile Commerce –
Proceedings of TAMoCo 2009
Vol. 200. V. Dimitrova et al. (Eds.), Artificial Intelligence in Education – Building Learning
Systems that Care: From Knowledge Representation to Affective Modelling
Vol. 199. H. Fujita and V. Mařík (Eds.), New Trends in Software Methodologies, Tools and
Techniques – Proceedings of the Eighth SoMeT_09
Vol. 198. R. Ferrario and A. Oltramari (Eds.), Formal Ontologies Meet Industry
Vol. 197. R. Hoekstra, Ontology Representation – Design Patterns and Ontologies that Make
Sense
ISSN 0922-6389
Neural Nets WIRN09
Proceedings of the 19th Italian Workshop on Neural Nets,
Vietri sul Mare, Salerno, Italy, May 28–30 2009
Edited by
Bruno Apolloni
Università degli Studi di Milano, Dipartimento di Scienze dell’Informazione,
Via Comelico 39, 20135 Milano, Italy
Simone Bassis
Università degli Studi di Milano, Dipartimento di Scienze dell’Informazione,
Via Comelico 39, 20135 Milano, Italy
and
Carlo F. Morabito
Università di Reggio Calabria, IMET, Loc. Feo di Vito,
89128 Reggio Calabria, Italy
Amsterdam • Berlin • Tokyo • Washington, DC
© 2009 The authors and IOS Press.
All rights reserved. No part of this book may be reproduced, stored in a retrieval system,
or transmitted, in any form or by any means, without prior written permission from the publisher.
ISBN 978-1-60750-072-8
Library of Congress Control Number: not yet known
Publisher
IOS Press BV
Nieuwe Hemweg 6B
1013 BG Amsterdam
Netherlands
fax: +31 20 687 0019
e-mail: [email protected]
Distributor in the USA and Canada
IOS Press, Inc.
4502 Rachael Manor Drive
Fairfax, VA 22032
USA
fax: +1 703 323 3668
e-mail: [email protected]
LEGAL NOTICE
The publisher is not responsible for the use which might be made of the following information.
PRINTED IN THE NETHERLANDS
Preface
Human Beings leave, the Science continues.
This volume collects contributions to the 19th Italian Workshop of the Italian So-
ciety for Neural Network (SIREN). The conference held a few days after the death of
prof. Maria Marinaro, who was a founder and a solid leader of the society. The confer-
ence was sad for this, but more intense at the same time. With neural networks we are
exploring thought mechanisms that share the two features of an efficient computational
tool and a representative of the physics of our brain, having the loops of our thoughts as
an ultimate product. It is not a duty of our discipline sentencing what happens when
these loops stop, but is a fascinating goal shedding light on how these loops run and
which tracks they leave.
The Science continues, and we dedicate these selected papers to Maria. We have
grouped them within the five themes of : “modeling”, “signal processing” , “medicine
application”, “economy”, and “general applications”. They come from three regular
sessions of the conference plus two specific workshops on “Computational Intelligence
for Economics and Finance” and “COST 2102: Cross Modal Analysis of Verbal and
Nonverbal Communications”, respectively. The editors would like to thank the invited
speakers as well as all those who contributed to the success of the workshops with pa-
pers of outstanding quality. Finally, special thanks go to the referees for their valuable
input.
Neural Nets WIRN09B. Apolloni et al. (Eds.)IOS Press, 2009© 2009 The authors and IOS Press. All rights reserved.
v
Contents
Preface v
Chapter 1. Models
The Discriminating Power of Random Features 3
Stefano Rovetta, Francesco Masulli and Maurizio Filippone
The Influence of Noise on the Dynamics of Random Boolean Network 11
A. Barbieri, M. Villani, R. Serra, S.A. Kauffman and A. Colacci
Toward a Space-Time Mobility Model for Social Communities 19
Bruno Apolloni, Simone Bassis and Lorenzo Valerio
Notes on Cutset Conditioning on Factor Graphs with Cycles 29
Francesco Palmieri
Neural Networks and Metabolic Networks: Fault Tolerance
and Robustness Features 39
Vincenzo Conti, Barbara Lanza, Salvatore Vitabile and Filippo Sorbello
Chapter 2. Signal Processing
The COST 2102 Italian Audio and Video Emotional Database 51
Anna Esposito, Maria Teresa Riviello and Giuseppe Di Maio
Face Verification Based on DCT Templates with Pseudo-Random Permutations 62
Marco Grassi and Marcos Faundez-Zanuy
A Real-Time Speech-Interfaced System for Group Conversation Modeling 70
Cesare Rocchi, Emanuele Principi, Simone Cifani, Rudy Rotili,
Stefano Squartini and Francesco Piazza
A Partitioned Frequency Block Algorithm for Blind Separation
in Reverberant Environments 81
Michele Scarpiniti, Andrea Picaro, Raffaele Parisi and Aurelio Uncini
Transcription of Polyphonic Piano Music by Means of Memory-Based
Classification Method 91
Giovanni Costantini, Massimiliano Todisco and Renzo Perfetti
A 3D Neural Model for Video Analysis 101
Lucia Maddalena and Alfredo Petrosino
A Wavelet Based Heuristic to Dimension Neural Networks for Simple
Signal Approximation 110
Gabriele Colombini, Davide Sottara, Luca Luccarini and Paola Mello
vii
Support Vector Machines and MLP for Automatic Classification
of Seismic Signals at Stromboli Volcano 116
Ferdinando Giacco, Antonietta Maria Esposito, Silvia Scarpetta,
Flora Giudicepietro and Maria Marinaro
Chapter 3. Economy and Complexity
Thoughts on the Crisis from a Scientific Perspective 127
Jaime Gil-Aluja
Aggregation of Opinions in Multi Person Multi Attribute Decision Problems
with Judgments Inconsistency 136
Silvio Giove and Marco Corazza
Portfolio Management with Minimum Guarantees: Some Modeling
and Optimization Issues 146
Diana Barro and Elio Canestrelli
The Treatment of Fuzzy and Specific Information Provided by Experts
for Decision Making in the Selection of Workers 154
Jaime Gil-Lafuente
An Intelligent Agent to Support City Policies Decisions 163
Agnese Augello, Giovanni Pilato and Salvatore Gaglio
“Pink Seal” a Certification for Firms’ Gender Equity 169
Tindara Addabbo, Gisella Facchinetti, Giovanni Mastroleo and Tiziana Lang
Intensive Computational Forecasting Approach to the Functional Demographic
Lee Carter Model 177
Valeria D’Amato, Gabriella Piscopo and Maria Russolillo
Conflicts in the Middle-East. Who Are the Actors? What Are Their Relations?
A Fuzzy LOGICal Analysis for IL-LOGICal Conflicts 187
Gianni Ricci, Gisella Facchinetti, Giovanni Mastroleo, Francesco Franci
and Vittorio Pagliaro
Chapter 4. Biological Aspects
Comparing Early and Late Data Fusion Methods for Gene Function Prediction 197
Matteo Re and Giorgio Valentini
An Experimental Comparison of Random Projection Ensembles with Linear
Kernel SVMs and Bagging and BagBoosting Methods for the Classification
of Gene Expression Data 208
Raffaella Folgieri
Changes in Quadratic Phase Coupling of EEG Signals During Wake and Sleep
in Two Chronic Insomnia Patients, Before and After Cognitive
Behavioral Therapy 217
Stephen Perrig, Pierre Dutoit, Katerina Espa-Cervena,
Vladislav Shaposhnyk, Laurent Pelletier, François Berger
and Alessandro E.P. Villa
viii
SVM Classification of EEG Signals for Brain Computer Interface 229
G. Costantini, M. Todisco, D. Casali, M. Carota, G. Saggio, L. Bianchi,
M. Abbafati and L. Quitadamo
Role of Topology in Complex Neural Networks 234
Luigi Fortuna, Mattia Frasca, Antonio Gallo, Alessandro Spata
and Giuseppe Nunnari
Non-Iterative Imaging Method for Electrical Resistance Tomography 241
Flavio Calvano, Guglielmo Rubinacci and Antonello Tamburrino
Role of Temporally Asymmetric Synaptic Plasticity to Memorize Group-Synchronous
Patterns of Neural Activity 247
Silvia Scarpetta, Ferdinando Giacco and Maria Marinaro
Algorithms and Topographic Mapping for Epileptic Seizures Recognition
and Prediction 261
N. Mammone, F. La Foresta, G. Inuso, F.C. Morabito, U. Aguglia
and V. Cianci
Computational Intelligence Methods for Discovering Diagnostic Gene Targets
About aGVHD 271
Maurizio Fiasché, Maria Cuzzola, Roberta Fedele, Domenica Princi,
Matteo Cacciola, Giuseppe Megali, Pasquale Iacopino and
Francesco C. Morabito
Dynamic Modeling of Heart Dipole Vector for the ECG and VCG Generation 281
Fabio La Foresta, Nadia Mammone, Giuseppina Inuso
and Francesco Carlo Morabito
Chapter 5. Applications
The TRIPLE Hybrid Cognitive Architecture: Connectionist Aspects 293
Maurice Grinberg and Vladimir Haltakov
Interactive Reader Device for Visually Impaired People 306
Paolo Motto Ros, Eros Pasero, Paolo Del Giudice, Vittorio Dante
and Erminio Petetti
On the Relevance of Image Acquisition Resolution for Hand Geometry
Identification Based on MLP 314
Miguel A. Ferrera, Joan Fàbregas, Marcos Faundez-Zanuy,
Jesús B. Alonso, Carlos Travieso and Amparo Sacristan
Evaluating Soft Computing Techniques for Path Loss Estimation
in Urban Environments 323
Filippo Laganà, Matteo Cacciola, Salvatore Calcagno, Domenico De Carlo,
Giuseppe Megali, Mario Versaci and Francesco Carlo Morabito
ix
The Department Store Metaphor: Organizing, Presenting and Accessing Cultural
Heritage Components in a Complex Framework 332
Umberto Maniscalco, Gianfranco Mascari and Giovanni Pilato
Subject Index 339
Author Index 341
x
Subject Index
ACE 163
algorithmic inference 19
ANN 271
artificial intelligence 29
artificial neural network(s) 306, 323
associative memory 247
attractors 11
audio and video recordings 51
background modeling 101
background subtraction 101
BagBoosting 208
Bagging 208
Bayesian decision networks 163
Bayesian networks 29
bicoherence 217
biometric recognition 62
biometrics 314
bispectrum 217
blind source separation 81
brain mapping 261
chatbot 163
Choquet integral 136
classification 91, 229
cognitive behavioral therapy 217
cognitive modeling 293
complex network(s) 39, 234
conflict 187
consensus management 136
constant Q transform 91
conversation modeling 70
convex weighting 154
cortical dynamics 247
cortico-cortical resonances 217
data fusion 29
data integration 197
decision fusion 197
decision making 154
decision templates 197
distance 154
DSS 163
dynamic portfolio management 146
dynamic similarity assessment 293
early fusion 197
ECG 281
electrical resistance tomography 241
electroencephalography
(EEG) 229, 261
emergent computation 11
ensemble 208
entropy 261
epilepsy 261
equal opportunities 169
evaluating forecasts 323
face recognition 62
feature selection 271
forecasting 177
foreground modeling 101
frequency domain algorithms 81
functional demographic model 177
fuzzy expert system 169
fuzzy logic 169, 187
fuzzy number 154
gender equity 169
gene function prediction 197
gene targets 271
GEP 271
group decision theory 136
GVHD 271
hand-geometry 314
haptic interfaces 306
heart dipole model 281
heart diseases 281
image processing 306
insomnia 217
inverse problems 241
keyword spotting 70
late fusion 197
Lee Carter model 177
linear predictive coding 116
MATLAB 29
metabolic networks 39
Middle East 187
minimum guarantee 146
mobility model 19
moving object detection 101
multi criteria analysis 136
Neural Nets WIRN09B. Apolloni et al. (Eds.)IOS Press, 2009© 2009 The authors and IOS Press. All rights reserved.
339
multilayer perceptron 116
music transcription 91
Naive Bayes combiner 197
neural network(s) 39, 101, 116, 314
neuron 234
noise 11, 234
non additive measures 136
non iterative imaging methods 241
non-destructive testing 241
onset detection 91
optical character recognition 306
pareto-like distribution law 19
partitioned block algorithms 81
path loss prediction 323
perceptual assessment 51
privacy 62
processes with memory 19
propagation of belief 29
random Boolean networks 11
random projection 208
random subspace 208
randomized maps 208
real-time systems 306
resolution 314
retrieval and mapping 293
reverberant environment 81
robustness and fault tolerance
comparison 39
scenario 146
security 62
seismic signals discrimination 116
self organization 101
smoothing 177
SNR 271
social communities 19
spatio-temporal patterns 247
stopped object 101
support vector machine(s)
(SVM) 91, 116, 229, 323
synchrony 247
tabletop 70
topology 234
uncertainty 154
urban environment 323
VCG 281
vector space integration 197
vocal and facial expression
of emotion 51
weighted averaging 197
340
Author Index
Abbafati, M. 229
Addabbo, T. 169
Aguglia, U. 261
Alonso, J.B. 314
Apolloni, B. 19
Augello, A. 163
Barbieri, A. 11
Barro, D. 146
Bassis, S. 19
Berger, F. 217
Bianchi, L. 229
Cacciola, M. 271, 323
Calcagno, S. 323
Calvano, F. 241
Canestrelli, E. 146
Carota, M. 229
Casali, D. 229
Cianci, V. 261
Cifani, S. 70
Colacci, A. 11
Colombini, G. 110
Conti, V. 39
Corazza, M. 136
Costantini, G. 91, 229
Cuzzola, M. 271
D’Amato, V. 177
Dante, V. 306
De Carlo, D. 323
Del Giudice, P. 306
Di Maio, G. 51
Dutoit, P. 217
Espa-Cervena, K. 217
Esposito, A. 51
Esposito, A.M. 116
Fàbregas, J. 314
Facchinetti, G. 169, 187
Faundez-Zanuy, M. 62, 314
Fedele, R. 271
Ferrera, M.A. 314
Fiasché, M. 271
Filippone, M. 3
Folgieri, R. 208
Fortuna, L. 234
Franci, F. 187
Frasca, M. 234
Gaglio, S. 163
Gallo, A. 234
Giacco, F. 116, 247
Gil-Aluja, J. 127
Gil-Lafuente, J. 154
Giove, S. 136
Giudicepietro, F. 116
Grassi, M. 62
Grinberg, M. 293
Haltakov, V. 293
Iacopino, P. 271
Inuso, G. 261, 281
Kauffman, S.A. 11
La Foresta, F. 261, 281
Laganà, F. 323
Lang, T. 169
Lanza, B. 39
Luccarini, L. 110
Maddalena, L. 101
Mammone, N. 261, 281
Maniscalco, U. 332
Marinaro, M. 116, 247
Mascari, G. 332
Mastroleo, G. 169, 187
Masulli, F. 3
Megali, G. 271, 323
Mello, P. 110
Morabito, F.C. 261, 271, 281, 323
Motto Ros, P. 306
Nunnari, G. 234
Pagliaro, V. 187
Palmieri, F. 29
Parisi, R. 81
Pasero, E. 306
Pelletier, L. 217
Perfetti, R. 91
Perrig, S. 217
Petetti, E. 306
Petrosino, A. 101
Piazza, F. 70
Picaro, A. 81
Neural Nets WIRN09B. Apolloni et al. (Eds.)IOS Press, 2009© 2009 The authors and IOS Press. All rights reserved.
341
Pilato, G. 163, 332
Piscopo, G. 177
Princi, D. 271
Principi, E. 70
Quitadamo, L. 229
Re, M. 197
Ricci, G. 187
Riviello, M.T. 51
Rocchi, C. 70
Rotili, R. 70
Rovetta, S. 3
Rubinacci, G. 241
Russolillo, M. 177
Sacristan, A. 314
Saggio, G. 229
Scarpetta, S. 116, 247
Scarpiniti, M. 81
Serra, R. 11
Shaposhnyk, V. 217
Sorbello, F. 39
Sottara, D. 110
Spata, A. 234
Squartini, S. 70
Tamburrino, A. 241
Todisco, M. 91, 229
Travieso, C. 314
Uncini, A. 81
Valentini, G. 197
Valerio, L. 19
Versaci, M. 323
Villa, A.E.P. 217
Villani, M. 11
Vitabile, S. 39
342
Support Vector Machines and MLP forautomatic classification of seismic signals
at Stromboli volcano
Ferdinando GIACCOa,1, Antonietta Maria ESPOSITOb, Silvia SCARPETTAa,c,d,Flora GIUDICEPIETROb and Maria MARINAROa,c,d
a Department of Physics, University of Salerno, Italyb Istituto Nazionale di Geofisica e Vulcanologia (Osservatorio Vesuviano), Napoli, Italy
c INFN and INFM CNISM, Salerno, Italyd Institute for Advanced Scientific Studies, Vietri sul Mare, Italy, Germany
Abstract. We applied and compared two supervised pattern recognition techniques,namely the Multilayer Perceptron (MLP) and Support Vector Machine (SVM),to classify seismic signals recorded on Stromboli volcano. The available data arefirstly preprocessed in order to obtain a compact representation of the raw seismicsignals. We extract from data spectral and temporal information so that each inputvector is made up of 71 components, containing both spectral and temporal infor-mation extracted from the early signal. We implemented two classification strate-gies to discriminate three different seismic events: landslide, explosion-quake, andvolcanic microtremor signals. The first method is a two-layer MLP network, witha Cross-Entropy error function and logistic activation function for the output units.The second method is a Support Vector Machine, whose multi-class setting is ac-complished through a 1vsAll architecture with gaussian kernel. The experimentsshow that although the MLP produces very good results, the SVM accuracy is al-ways higher, both in term of best performance, 99.5%, and average performance,98.8%, obtained with different sampling permutations of training and test sets.
Keywords. Seismic signals discrimination, Linear Predictive Coding, NeuralNetworks, Support Vector Machine, Multilayer Perceptron.
Introduction
Automatic discrimination among seismic events is a critical issue for the continuousmonitoring of seismogenic zones and active volcanic areas. This is the case of the Strom-boli island (southern Italy) where the seismic activity is intense and the analysis of thedata should be very fast in order to communicate, as soon as possible, the significance ofthe recorded information to civil defense authorities.
The available data are provided through a broadband seismic network, installed dur-ing the crisis of December 2002, to monitor the evolution of the volcanic processes [1,2].Since its installation, the network has recorded many thousands of explosion-quake and
1Corresponding Author: Ferdinando Giacco, Department of Physics, University of Salerno, Via S. Allende,84081 Baronissi (SA), Italy; E-mail: [email protected].
landslide signals. The detection of landslide seismic signals and their discrimination fromthe other transient signals was one of the most useful tools for monitoring the stabilityand the activity of the northwest flank of the volcano.
In recent years, several methods have been proposed for detecting and discriminat-ing among different seismic signals, based on spectral analysis [3,11] , cross-correlationtechnique [5,6] and neural networks [7,8,9,4,10]. In this paper we report on two differ-ent supervised approaches for discrimination among explosion-quake, landslide and mi-crotremor signals, which characterize the strombolian activity. The first method is basedon one of the widespread used neural network, the Multilayer Perceptron (MLP), whilethe second is the Support Vector Machine algorithm (SVM) [13].
The Support Vector Machines, originally developed for the discrimination of two-classes problems, have then been extended to multi-class settings [15], and nowadaysmulti-class SVMs architectures like 1vs1 and 1vsAll are widely used in different fields[16,17,18,20], including recent application on seismic signals recognition [19].
The reminder of the paper is organized as follows: Section I describes data and thepreprocessing techniques to represent data in a meaningful and compressed form; SectionII describes the classification techniques, namely Multilayer perceptron (A) and SupportVector Machine (B); lastly, in Section III, the conclusion on the experimental results arereported.
1. Seismic Data and preprocessing
Stromboli is a volcanic island in the Mediterranean Sea located north of eastern Sicily.Stromboli exhibits continuous eruptive activity generally involving the vents at the top ofthe cone. This activity consists of individual explosions emitting gasses and pyroclasticfragments typically six to seven times per hour. Seismic signals recorded on Stromboliare characterized by microtremor and explosion-quakes, usually associated with Strom-bolian explosions. This typical Strombolian activity sometimes stops during sporadic ef-fusive episodes characterized by lava flows. The most recent effusive phases occurred in1930, 1974, 1985, in December 2002 and the last one in February 2007. The December2002 effusive phase began with a large landslide on the “Sciara del Fuoco”, a depres-sion on the northwest flank of the volcano that generated a tsunami with maximum waveheight of about 10 m.
After this episode the northwest flank became unstable and as many as 50 landslidesignals per day were recorded by the seismic monitoring network operated by the IstitutoNazionale di Geofisica e Vulcanologia (INGV) [12].
The broadband network operated by the INGV for the seismic monitoring ofStromboli volcano has operated since January 2003. It consists of 13 digital stationsequipped with three-component broadband Guralp CMG-40 seismometers, with fre-quency response of 60 sec (see Fig. 1). The data are acquired by digital recorders,with a sampling rate of 50 samples/sec, and are continuously transmitted via Internetto the recording center in Naples at the Vesuvius Observatory (INGV). A more detaileddescription of the seismic network and data-acquisition system can also be found atwww.ov.ingv.it/stromboli.html [1]. Since its installation, the network has recorded astransient signals some hundreds of thousands of explosion-quakes and thousands of land-slides, in addition to continuous volcanic microtremor signals. The explosion-quakes are
Figure 1. Map showing the current network geometry of 13 digital broadband stations deployed on StromboliIsland.
characterized by a signal exhibiting no distinct seismic phases and having a frequencyrange of 1U10 Hz. Landslide signals are higher in frequency than the explosion-quakesand their typical waveform has an emergent onset. The microtremor is a continuous sig-nal having frequencies between 1 and 3 Hz. The network has also recorded local, re-gional and teleseismic events. The data set includes1159 records from the three com-ponents of five seismic stations: STR1, STRA, STR8, STR5, STRB (see Fig. 1). It ismade up of 430 explosion-quakes, 267 landslides, and 462 microtremor signals. For eachevent, a record of 23 sec is taken, at 50-Hz sampling frequency. The arrival-time pickingof the explosion-quake and landslide signals has been performed by the analysts, usingdata windows having about 3 sec of pre-event signal. Hence, we used 5/8 of the availabledata as the training set (724 samples) and the remaining 3/8 provides the testing set (435samples).
The preprocessing stage is performed using the Linear Predictive Coding (LPC)technique [21], a technique frequently used in the speech recognition field to extractcompact spectral information. LPC tries to predict a signal sample by means of a linearcombination of various previous signal samples, that is:
s∗(n) ' c1s(n− 1) + c2s(n− 2) + . . . + cps(n− p) (1)
wheres(n) is the signal sample at timen ands∗(n) is its prediction,p is the model order(the number of the prediction coefficients). The estimate of the prediction coefficientsci , for i = 1, . . . p, is obtained by an optimization procedure that tries to reduce theerror between the real signal at time n and its LPC estimate. The number of predictioncoefficientsp is problem dependent. This number must be determined via a trade-off be-tween preserving the information content and optimizing compactness of the representa-tion. Here, we choose a 256-point window usingp = 6 LPC coefficients for each signal.Increasingp does not improve the information content significantly, but decreases thecompactness of the representation markedly. Therefore we extract six coefficients from
each of the eight Hanning windows (5 secs) in which we divided the file signal, eachwindow overlapping with the previous one by 2.5 sec. Because LPC provides frequencyinformation [21], we have also added time-domain information. We use the functionfm
computed as the difference, properly normalized, between the maximum and minimumsignal amplitudes within a 1-sec windowWm:
fm =(max[si]−min[si])×N∑N
n=1(max[si]−min[si]), i ∈ Wm, m = 1, . . . , N. (2)
Thus, for aN = 23 sec signal, we obtain a 23-element time-features vector. There-fore, each signal (composed of 1159 samples), is encoded with a 71-feature vector(6 × 8 = 48 frequency features+23 time features). The use of both spectral and tem-poral features more closely approximates the waveform characteristics considered byseismologists when visually classifying signals.
In the following, we design two supervised techniques to build an automatic clas-sifier and we train them to distinguish between landslides, explosion-quakes, and mi-crotremor.
2. Classification techniques
2.1. Multi-layer Perceptron
The multilayer perceptron (MLP) using back-propagation learning algorithm ([19]) isone of the most widely used neural network. There are two kinds of information process-ing performed in multilayer perceptron. The first one is the forward propagation of theinput by the environment through the network from the input units to the output units.The other one is the learning algorithm, which consists of the backpropagation of the er-rors by the environment through the network from the output units to the input units, andweight and bias updates. The purpose of back propagation is to adjust the internal state(weights and biases) of the multilayer perceptron so that to produce the desired outputfor the specified input.
In our experiments we built a two-layer MLP network for the three-class discrim-ination problem [25]. Weight optimization is carried out during the training proce-dure through minimisation of the Cross-Entropy Error Function [22] using the Quasi-Newton algorithm [22]. The network output activation function is the logistic while thehyperbolic-tangent is used for the hidden units. Moreover, when logistic output units andcross-entropy error function are used together, the network output represents the prob-ability of an input vector to belong to one of the investigated classes. The number ofhidden units and training cycles has been chosen empirically by trial and error.
Lastly, to verify the generalization ability of the network, after the training stepwe test the MLP on a subset (the testing set) not used to train the network. To assessthe system robustness we test the network several times, changing randomly the weightinitialization and the permutation of data. In this way the network performance is theaverage of the percentages of correct classification obtained with each test. The TableI shows the error matrix corresponding to the best classification performance, obtainedwith an MLP architecture made up of 5 hidden units and 110 training cycles. The best
Table 1. Error matrix corresponding to the best MLP performance, obtained with a network architecture madeup of 5 hidden units and 110 training cycles. The overall accuracy is 98.39 %.
Classes Landslide Explosion-quake Microtremor
Landslide 97 0 4
Explosion-quake 0 167 1
Microtremor 2 0 164
Figure 2. SVM optimal solution in a two dimensional space for a non-linearly separable classification prob-lem. The distance between the optimal hyperplane and the nearest datum is called margin, while the data cor-responding to the filled circles and the filled rectangle aresupport vectors. The slack variablesξi andξj arehere introduced (see Eqn. 3 ) to allow the violation of constraints for some training samples.
overall accuracy is 98.39% while the average value taken on 10 different permutationsof training and test set is 97.2 %.
2.2. Support Vector Machines
Support Vector Machines (SVMs) have become a popular method in pattern classifica-tion for their ability to cope with small training set and high-dimensional data [13,14,15].
The SVM algorithm goal is to find the separating decision function with the max-imum separating margin, in order to maximize the generalization ability when a newsample is presented. This can be formulated through a lagrangian minimization problemwith inequality constraints on data separation.
If the training data are linearly separable all the samples lie above the maximummargin, while the data lying on the margins are calledsupport vectors. In our study weused an SVM formulation assuming that data are not linearly separable (see Fig. 2). Inthis case, we allow the violation of some constraints by introducing the non-negativeslack variables,ξi ≥ 0, into the lagrangian problem. Namely, the lagrangianQ (for Mtraining samples) is given by
Q(w, b, ξ) =12‖w‖2 + C
M∑
i=1
ξi (3)
wherew is an m-dimensional vector which locates the optimal hyperplane,b is a biasterm andC is a parameter determining the weight of the slack variablesξi. The inequalityconstraints are then given by
yi(wT xi + b) ≥ 1− ξi for i = 1, . . . , M, (4)
wherexi are the training samples andyi the associated labels (i.e.±1 in a binary setting).The solution of the SVM problem is then achieved by introducing lagrangian multipliers,α1, . . . , αM , and looking for the related “dual problem”, given by
Q(α) =M∑
i=1
αi − 12
M∑
i,j=1
αiαjyiyjxTi xj (5)
subject to the constraints
M∑
i=1
yiαi = 0, C ≥ αi ≥ 0 for i = 1, . . . , M. (6)
One of the advantage of the SVM algorithm is that the solution is unique and it onlydepends on the support vectors.
Hence, to enhance linear separability, the original input space is mapped into a high-dimensional dot-product space called the feature space. The advantage of using kernelsis that we need not treat the high-dimensional feature space explicitly, namely, in solvingEqn. 5 we useK(xi, xj) instead ofxT
i xj . The most used “kernel functions” in literatureare polynomials (of different degrees) and gaussians. In our experiments we tried bothkernel choices, finding that the best performance is achieved using a gaussian kernel,namely
K(xi, xj) = exp(−γ‖xi − xj‖2), (7)
whereγ is an additional parameter manually determined.Concerning the possibilities to extend the originally binary SVMs to multi-class
settings, there has been quite some research recently [23,24]. Two main architectureswere originally proposed for anl-classes problem [15]:
• One versus All (1vsAll):l binary classifiers are applied on each class versus theothers. Each sample is assigned to the class with the maximum output.
• One versus One (1vs1):l(l − 1)/2 binary classifiers are applied, one for eachpair of classes. Each sample is assigned to the class getting the highest numberof votes. A vote for a given class is defined as a classifier assigning the pattern tothat class.
In the current case, we focus on the 1vsAll approach, building 3 different SVMs, each ofwhich is able to separate one specific class from all the others. We tried several values forthe parametersC and used the gaussian kernel reported in Eqn. 7. The best classificationhas an overall accuracy of 99.54% (see Table II), while the average value computed on10 different permutations of training and test is 98.76 % .
Table 2. Error matrix corresponding to the best 1vsAll SVM performance with gaussian kernel. The overallaccuracy is 99.54%
Classes Landslide Explosion-quake Microtremor
Landslide 91 0 1
Explosion-quake 0 169 0
Microtremor 1 0 173
3. Conclusions
Two supervised strategies have been implemented to discriminate among three differentseismic events: landslides, explosion-quakes and microtremor. Looking at the results wecan state that the discrimination performance is very good for both MLP and SVM al-gorithms. The MLP best performance has a percentage of correct classification of 98.4,while the average value obtained on several training and test sampling is 97.2%. How-ever, the 1vsAll SVM with gaussian kernel always achieves higher accuracy both in termof best, 99.5%, and average, 98.8%, performance.
We also remark that the extracted features used as parametric and compressed repre-sentation of the seismic signals give robust information on their nature. This can be alsoargued by looking at the SVMs results, where the solution only depends on the supportvectors, acting as the relevant part of the training set. Indeed, different permutations ofthe training and test sets provide different results, meaning that many support vectorsare present within the data, that is the data representation is well suited for the requiredclassification task.
References
[1] W. De Cesare, M. Orazi, R. Peluso, G. Scarpato, A. Caputo, L. D’Auria, F. Giudicepietro, M. Martini, C.Buonocunto, M. Capello, A. M. Esposito (2009) - The broadband seismic network of Stromboli volcano(Italy), Seismological Research Letters. In press.
[2] M. Martini, F. Giudicepietro, L. D’auria, A. M. Esposito, T. Caputo, R. Curciotti, W. De Cesare, M.Orazi, G. Scarpato, A. Caputo, R. Peluso, P. Ricciolino, A. Linde, S. Sacks (2008) - Seismologicalmonitoring of the February 2007 effusive eruption of the Stromboli volcano, Annals of Geophysics, Vol.50, N. 6, December 2007, pp. 775-788.
[3] Hartse, H. E., W. S. Phillips, M. C. Fehler, and L. S. House (1995). Singlestation spectral discriminationusing coda waves, Bull. Seism. Soc. Am. 85, 1464U1474.
[4] Del Pezzo, E., A. Esposito, F. Giudicepietro, M. Marinaro, M. Martini, and S. Scarpetta (2003). Dis-crimination of earthquakes and underwater explosions using neural networks, Bull. Seism. Soc. Am. 93,no. 1, 215U223.
[5] Joswig, M. (1990). Pattern recognition for earthquake detection, Bull. Seism. Soc. Am. 80, 170U186.
Table 3. Long Table caption. Dic, quaeso, mihi: quae est ista, quae consurgens ut aurora rutilat ut beneficam.Nationi, et iudicia sua non manifestavit eis
Quia fecit Quia respexit
Fecit Nationi, et iudicia sua non manifestavit eis. Et exultavit spiritus meus in salutari meo
Respexit Et misericordia eius a progenie
[6] Rowe, C. A., C. H. Thurber, and R. A. White (2004). Dome growth behavior at Soufriere Hills volcano,Montserrat, revealed by relocation of volcanic event swarms, 1995U1996, J. Volc. Geotherm. Res. 134,199U221.
[7] Dowla, F. U. (1995). Neural networks in seismic discrimination, in Monitoring a Comprehensive TestBan Treaty, E. S. Husebye and A. M. Dainty (Editors), NATO ASI, Series E, Vol. 303, Kluwer, Dor-drecht, The Netherlands, 777U789.
[8] Wang, J., and T. Teng (1995). Artificial neural network based seismic detector, Bull. Seism. Soc. Am.85, 308U319.
[9] Tiira, T. (1999). Detecting teleseismic events using artificial neural networks, Comp. Geosci. 25,929U939.
[10] Esposito, M., F. Giudicepietro, L. D’Auria, S. Scarpetta, M. G. Martini, M. Coltelli, and M. Marinaro(2008). Unsupervised Neural Analysis of Very-Long-Period Events at Stromboli Volcano Using theSelf-Organizing Maps, Bull. Seism. Soc. Am., Vol. 98, No. 5, pp. 2449U2459.
[11] Gitterman, Y., V. Pinky, and A. Shapira (1999). Spectral discrimination analysis of Eurasian nuclear testsand earthquakes recorded by the Israel seismic network and the NORESS array, Phys. Earth. Planet.Interiors 113, 111U129.
[12] Martini, M., B. Chouet, L. DŠAuria, F. Giudicepietro, and P. Dawson (2004). The seismic source stabil-ity of the Very Long Period signals of the Stromboli volcano, in I General Assembly AbstractsUEGU,Nice, 25U30 April 2004.
[13] Vapnik, V. N. (1995). The Nature of Statistical Learning Theory, Springer.[14] Webb, A. R. (2002). Statistical Pattern Recognition, John Wiley and Sons.[15] Schffolkopf, B. and A.J. Smola (2002). Learning with Kernels: Support Vector Machines, Regulariza-
tion, Optimization and Beyond, MIT Press.[16] Melgani, F. and L. Bruzzone (2004). Classification of hyperspectral remote sensing images with support
vector machines, IEEE Trans. on Geoscience and Remote Sensing, vol. 42, pp. 1778-1790.[17] Foody, G. F. and Ajay Mathur (2004). A relative evaluation of multiclass image classification by support
vector machines, IEEE Trans. on Geoscience and Remote Sensing, vol. 42, pp. 1335-1343.[18] Hsu, C. W. and C. J. Lin (2002). A comparison of methods for multiclass support vector machines, IEEE
Trans. on Neural Networks, vol. 13, pp. 415-425.[19] Masotti, M., S. Falsaperla, H. Langer, S. Spampinato, and R. Campanini (2006), Application of Support
Vector Machine to the classification of volcanic tremor at Etna, Italy, Geophys. Res. Lett., 33, L20304,doi:10.1029/2006GL027441.
[20] Kahsay, L., F. Schwenker and G. Palm (2005). Comparison of multiclass SVM decomposition schemesfor visual object recognition, LNCS, Springer, vol. 3663, pp. 334-341.
[21] Makhoul, J. (1975). Linear prediction: a tutorial review, Proc. IEEE 63, 561-580.[22] Bishop, C. (1995). Neural Networks for Pattern Recognition, Oxford University Press, New York, 500
pp.[23] F.Giacco, S. Scarpetta, L. Pugliese, M. Marinaro and C. Thiel. Application of Self organizing Maps to
multi-resolution and multi-spectral remote sensed images, “New directions in neural networks”, Pro-ceedings of 18th Italian Workshop on neural networks (WIRN 2008), IOS Press (Netherlands), pp. 245-253.
[24] C. Thiel, F. Giacco, F. Shwenker G. Palm. Comparison of neural Classification Algorithms applied toland cover mapping, “New directions in neural networks”, Proceedings of 18th Italian Workshop onneural networks (WIRN 2008), IOS Press (Netherlands), pp. 254-263.
[25] A. M. Esposito, F. Giudicepietro, S. Scarpetta, L. D’Auria, M. Marinaro, M. Martini (2006) - Automaticdiscrimination among landslide, explosion-quake and microtremor seismic signals at Stromboli volcanousing Neural Networks, Bull. Seismol. Soc. Am. (BSSA) Vol. 96, No. 4A, pp. 1230-1240, August 2006,doi: 10.1785/0120050097