modeling bottlenose dolphin habitat distri-9 by means of ......chiara marini the histograms (inside...

1
Modeling bottlenose dolphin habitat by means of static environmental variables Chiara Marini 1 , Fulvio Fossa 2 , Michela Bellingeri 2 , Guido Gnone 2 , Paolo Vassallo 1 1 DISTAV, Department of Land, Environment and Health Science, University of Genoa, Corso Europa 26, Genoa, Italy 2 Acquario di Genova, Area Porto Antico-Ponte Spinola, 16128, Genoa, Italy E-mail: [email protected] INTRODUCTION The application of models to study the relationship between species distribution and habitat use in cetaceans has grown considerably in recent decades and today is considered an interesting tool to support management policies. However, the researchers' attention has been focused so far only some species while attempts to develop models of habitat for bottlenose dolphins are relatively few. The most frequently applied techniques are regression models, such as GLM (Generalized Line Model) (Bailey and Thompson, 2006); and GAM (Generalized Additive Model) (Blasi and Boitani, 2012). The objective of this work is to provide a comparison of these two techniques with a type of analysis based on ensemble learning method, never applied in cetology: the Random Forest (RF) (Cutler et al., 2007). MATERIAL & METHODS The study area extends from Punta Chiappa (Genoa) to Punta Bianca (La Spezia) (FIG.1). Four static variables were identified: depth, bottom slope, distance from 100 meters bathymetric contour and distance from the coast. GLM and GAM are the most used techniques to model while RF is a learning technique based on ensemble learning method; the method requires a training phase to perform a decision-making process. A random number of ramifications is generated and a 'forest ' of possible sequences of choices outcomes. At the end of the process, the system detects the combination of variables with the highest percentage of success. RESULTS DISCUSSION In this study we identified the variables that influence the presence of bottlenose dolphin along the eastern Ligurian coast through the construction of habitat models. The results showed that the RF is the best technique that fitted the data set. This success comes from the ability of RF to analyze more accurately the dataset through the process of training earlier described. For bottlenose dolphin distribution in the Eastern coast of Liguria (NW Mediterranean Sea) RF allowed the identification of hot spots of presence along the coastal zone. Here the combination of the selected physiographic variables is particularly close to the signature that is expected to be strongly enticed in these sub- zones in the study area. Results obtained by RF application are particularly encouraging and are worthy of further applications in other areas of the Mediterranean to verify and validate results. REFERENCES Bailey H., Thompson P. (2006). Quantitative analysis of bottlenose dolphin movement patterns and their relationship with foraging. Anim. Ecol. 75:456465 Blasi M., Boitani L. (2012). Modelling fine-scale distribution of bottlenose dolphin Tursiops truncatus using physiographic features on Filicudi (southern Tyrrhenian Sea, Italy). Endangered Species Res. 17:269-288. Cutler DR., Edwards TC., Beard KH., Cutler A., Hess KT., Gibson J., Lawler JJ.(2007). “Random forest for classification in ecology. Ecology, Vol. 88, No. 11, pp 2783- 2792. Chiara Marini The histograms (inside of the Fig. 2-3-4) describe the number of sightings in function of predicted probabilities. Red vertical line identifies the average predicted probability in presence cells. RF resulted both the most accurate and the most precise with the highest average probability value (90.4% of mean predicted probabilities in presence cells) and with 66.7% of presence cells with a predicted probability between 90% and 100%. The univariate partial dependence plots (Fig.5) describe the three most important variables with an evident threshold effect for depth and distance from 100m. Figure 3: T. truncatus predictive map based on GAM. Cells with sightings are shown with black dots; black lines identify the main isobaths. A continuous pattern of high sighting probabilities parallel to coastline Figure 2: T. truncatus predictive map based on GLM. Cells with sightings are shown with black dots; black lines identify the main isobaths. Probabilities turned to low values moving toward open sea even if several sightings were recorded far from coast Figure 4: T. truncatus predictive map based on RF. Cells with sightings are shown with black dots; black lines identify the main isobaths. A few spots of parcels where the probability of sighting was very high DISTRI-9 number of surveys effort number of sightings NM h 2005 46 931 140 16 2006 50 1843 215 26 2007 50 1506 225 17 2008 49 1530 225 16 2009 50 1383 200 18 2010 49 1490 175 17 2011 60 1806 225 34 2012 48 1296 150 27 Total 402 11786 1556 171 Figure 1: study area (the blue line represented the effort from 2005 to 2012) Figure 5: Univariate partial dependence plots of RF most important variables in the study area

Upload: others

Post on 27-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Modeling bottlenose dolphin habitat DISTRI-9 by means of ......Chiara Marini The histograms (inside of the Fig. 2-3-4) describe the number of sightings in function of predicted probabilities

Modeling bottlenose dolphin habitat by means of static environmental variables

Chiara Marini1, Fulvio Fossa2, Michela Bellingeri2, Guido Gnone2, Paolo Vassallo1 1DISTAV, Department of Land, Environment and Health Science, University of Genoa, Corso Europa 26, Genoa, Italy

2Acquario di Genova, Area Porto Antico-Ponte Spinola, 16128, Genoa, Italy E-mail: [email protected]

INTRODUCTION

The application of models to study the

relationship between species distribution

and habitat use in cetaceans has grown

considerably in recent decades and

today is considered an interesting tool to

support management policies. However,

the researchers' attention has been

focused so far only some species while

attempts to develop models of habitat for

bottlenose dolphins are relatively few. The

most frequently applied techniques are

regression models, such as GLM

(Generalized Line Model) (Bailey and

Thompson, 2006); and GAM (Generalized

Additive Model) (Blasi and Boitani, 2012).

The objective of this work is to provide a

comparison of these two techniques with

a type of analysis based on ensemble

learning method, never applied in

cetology: the Random Forest (RF) (Cutler

et al., 2007).

MATERIAL & METHODS

The study area extends from Punta Chiappa (Genoa) to Punta Bianca (La

Spezia) (FIG.1). Four static variables were identified: depth, bottom slope,

distance from 100 meters bathymetric contour and distance from the

coast. GLM and GAM are the most used techniques to model while RF is a

learning technique based on ensemble learning method; the method

requires a training phase to perform a decision-making process. A random

number of ramifications is generated and a 'forest ' of possible sequences

of choices outcomes. At the end of the process, the system detects the

combination of variables with the highest percentage of success.

RESULTS

DISCUSSION

In this study we identified the variables

that influence the presence of

bottlenose dolphin along the eastern

Ligurian coast through the construction

of habitat models. The results showed

that the RF is the best technique that

fitted the data set.

This success comes from the ability of RF

to analyze more accurately the

dataset through the process of training

earlier described. For bottlenose

dolphin distribution in the Eastern coast

of Liguria (NW Mediterranean Sea) RF

allowed the identification of hot spots

of presence along the coastal zone.

Here the combination of the selected

physiographic variables is particularly

close to the signature that is expected

to be strongly enticed in these sub-

zones in the study area.

Results obtained by RF application are

particularly encouraging and are

worthy of further applications in other

areas of the Mediterranean to verify and validate results.

REFERENCES Bailey H., Thompson P. (2006). Quantitative analysis of bottlenose dolphin movement patterns and their relationship with foraging. Anim. Ecol. 75:456–465

Blasi M., Boitani L. (2012). Modelling fine-scale distribution of bottlenose dolphin Tursiops truncatus using physiographic features on Filicudi (southern Tyrrhenian

Sea, Italy). Endangered Species Res. 17:269-288.

Cutler DR., Edwards TC., Beard KH., Cutler A., Hess KT., Gibson J., Lawler JJ.(2007). “Random forest for classification in ecology. Ecology, Vol. 88, No. 11, pp 2783-

2792.

Chiara Marini

The histograms (inside of the

Fig. 2-3-4) describe the

number of sightings in function

of predicted probabilities. Red

vertical line identifies the

average predicted probability

in presence cells. RF resulted

both the most accurate and

the most precise with the highest average probability

value (90.4% of mean

predicted probabilities in

presence cells) and with 66.7%

of presence cells with a

predicted probability

between 90% and 100%. The

univariate partial

dependence plots (Fig.5)

describe the three most

important variables with an

evident threshold effect for depth and distance from

100m.

Figure 3: T. truncatus predictive map based on GAM. Cells with sightings are shown

with black dots; black lines identify the main isobaths.

A continuous pattern of

high sighting probabilities

parallel to coastline

Figure 2: T. truncatus predictive map based on GLM.

Cells with sightings are shown with black dots; black lines identify the main isobaths.

Probabilities turned to low

values moving toward

open sea even if several

sightings were recorded far

from coast

Figure 4: T. truncatus predictive map based on RF. Cells with sightings are shown

with black dots; black lines identify the main isobaths.

A few spots of parcels where the probability of sighting was very high

DISTRI-9

number of

surveys

effort number of

sightings NM h

2005 46 931 140 16

2006 50 1843 215 26

2007 50 1506 225 17

2008 49 1530 225 16

2009 50 1383 200 18

2010 49 1490 175 17

2011 60 1806 225 34

2012 48 1296 150 27

Total 402 11786 1556 171

Figure 1: study area (the blue line represented the effort from 2005 to 2012)

Figure 5: Univariate partial dependence plots of RF most important variables in

the study area