species distribution modelling for the people

12
BIODIVERSITY RESEARCH Species distribution modelling for the people: unclassified landsat TM imagery predicts bird occurrence at fine resolutions S. M. Shirley 1 *, Z. Yang 1 , R. A. Hutchinson 2 , J. D.Alexander 3 , K. McGarigal 4 and M. G. Betts 1 1 Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97331, USA, 2 School of EECS, Oregon State University, Corvallis, OR 97331, USA, 3 Klamath Bird Observatory, P.O. Box 758, Ashland, OR 97520, USA, 4 Department of Environmental Conservation, University of Massachusetts, 160 Holdsworth Way, Amherst, MA 01003- 9285, USA *Correspondence: S. M. Shirley, Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97331, USA. E-mail: [email protected] ABSTRACT Aim Assessing the influence of land cover in species distribution modelling is limited by the availability of fine-resolution land-cover data appropriate for most species responses. Remote-sensing technology offers great potential for predicting species distributions at large scales, but the cost and required exper- tise are prohibitive for many applications. We test the usefulness of freely avail- able raw remote-sensing reflectance data in predicting species distributions of 40 commonly occurring bird species in western Oregon. Location Central Coast Range, Cascade and Klamath Mountains Oregon, USA. Methods Information on bird observations was collected from 4598 fixed- radius point counts. Reflectance data were obtained using 30-m resolution Landsat imagery summarized at scales of 150, 500, 1000 and 2000 m. We used boosted regression tree (BRT) models to analyse relationships between distribu- tions of birds and reflectance values and evaluated prediction performance of the models using area under the receiver operating characteristic curve (AUC) values. Results Prediction success of models using all reflectance values was high (mean AUC = 0.79 0.10 SD). Further, model performance using individual reflectance bands exceeded those that used only Normalized Difference Vegeta- tion Index (NDVI). The relative influence of band 4 predictors was highest, indicating the importance of variables associated with vegetation biomass and photosynthetic activity. Across spatial scales, the average influence of predictors at the 2000 m scale was greatest. Main Conclusions We demonstrate that unclassified remote-sensing imagery can be used to produce species distribution models with high prediction success. Our study is the first to identify general patterns in the usefulness of spectral reflectances for species distribution modelling of multiple species. We conclude that raw Landsat Thematic Mapper data will be particularly useful in species distribution models when high-resolution predictions are required, including habitat change detection studies, identification of fine-scale biodiversity hotspots and reserve design. Keywords Boosted regression trees, land-cover, model validation, species distribution modelling, unclassified remote-sensing imagery, western Oregon. DOI: 10.1111/ddi.12093 ª 2013 John Wiley & Sons Ltd. http://wileyonlinelibrary.com/journal/ddi 1 Diversity and Distributions, (Diversity Distrib.) (2013) 1–12 A Journal of Conservation Biogeography Diversity and Distributions

Upload: others

Post on 01-Dec-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Species distribution modelling for the people

BIODIVERSITYRESEARCH

Species distribution modelling for thepeople: unclassified landsat TM imagerypredicts bird occurrence at fineresolutionsS. M. Shirley1*, Z. Yang1, R. A. Hutchinson2, J. D.Alexander3,

K. McGarigal4 and M. G. Betts1

1Department of Forest Ecosystems and

Society, Oregon State University, 321

Richardson Hall, Corvallis, OR 97331, USA,2School of EECS, Oregon State University,

Corvallis, OR 97331, USA, 3Klamath Bird

Observatory, P.O. Box 758, Ashland, OR

97520, USA, 4Department of Environmental

Conservation, University of Massachusetts,

160 Holdsworth Way, Amherst, MA 01003-

9285, USA

*Correspondence: S. M. Shirley, Department

of Forest Ecosystems and Society, Oregon

State University, 321 Richardson Hall,

Corvallis, OR 97331, USA.

E-mail: [email protected]

ABSTRACT

Aim Assessing the influence of land cover in species distribution modelling is

limited by the availability of fine-resolution land-cover data appropriate for

most species responses. Remote-sensing technology offers great potential for

predicting species distributions at large scales, but the cost and required exper-

tise are prohibitive for many applications. We test the usefulness of freely avail-

able raw remote-sensing reflectance data in predicting species distributions of

40 commonly occurring bird species in western Oregon.

Location Central Coast Range, Cascade and Klamath Mountains Oregon, USA.

Methods Information on bird observations was collected from 4598 fixed-

radius point counts. Reflectance data were obtained using 30-m resolution

Landsat imagery summarized at scales of 150, 500, 1000 and 2000 m. We used

boosted regression tree (BRT) models to analyse relationships between distribu-

tions of birds and reflectance values and evaluated prediction performance of

the models using area under the receiver operating characteristic curve (AUC)

values.

Results Prediction success of models using all reflectance values was high

(mean AUC = 0.79 � 0.10 SD). Further, model performance using individual

reflectance bands exceeded those that used only Normalized Difference Vegeta-

tion Index (NDVI). The relative influence of band 4 predictors was highest,

indicating the importance of variables associated with vegetation biomass and

photosynthetic activity. Across spatial scales, the average influence of predictors

at the 2000 m scale was greatest.

Main Conclusions We demonstrate that unclassified remote-sensing imagery

can be used to produce species distribution models with high prediction success.

Our study is the first to identify general patterns in the usefulness of spectral

reflectances for species distribution modelling of multiple species. We conclude

that raw Landsat Thematic Mapper data will be particularly useful in species

distribution models when high-resolution predictions are required, including

habitat change detection studies, identification of fine-scale biodiversity hotspots

and reserve design.

Keywords

Boosted regression trees, land-cover, model validation, species distribution

modelling, unclassified remote-sensing imagery, western Oregon.

DOI: 10.1111/ddi.12093ª 2013 John Wiley & Sons Ltd. http://wileyonlinelibrary.com/journal/ddi 1

Diversity and Distributions, (Diversity Distrib.) (2013) 1–12A

Jou

rnal

of

Cons

erva

tion

Bio

geog

raph

yD

iver

sity

and

Dis

trib

utio

ns

Page 2: Species distribution modelling for the people

INTRODUCTION

Land cover is considered to be one of the most important

drivers of biodiversity, affecting patterns of species diversity

(Jetz et al., 2007; Coops et al., 2009), species distributions

(Wilcove et al., 1998; Opdam & Wascher, 2004; Thuiller

et al., 2008) and ecological processes (Dickinson, 1991; Dale,

1997; Allan, 2004). However, we have a rather limited under-

standing of how recent land-cover changes have resulted in

changes to species distributions over larger scales (regional,

continental and global). Clearly, land cover exerts an influ-

ence on species distributions; the negative effects of habitat

loss on species extinctions are also well known (Balmford

et al., 2003). Land-cover changes over broad scales have the

potential to affect biodiversity through a number of mecha-

nisms such as habitat loss and fragmentation, enabling bio-

logical invasions and impairment of ecological processes

critical to ecosystem function. Whereas climate is hypo-

thesized to exert the major influence on distributions at

broad spatial scales, land cover may be the dominant influ-

ence over short terms and at finer spatial scales (Lemoine

et al., 2007; Soberon, 2007). However, this hypothesis has

been difficult to test because our ability to predict the effects

of land cover and land-cover change on species distributions

has been limited by the availability of estimates of land cover

at appropriate spatial and temporal scales. Habitat data at a

sufficiently fine resolution for most organisms under study

(Mladenoff et al., 1999; Bowman et al., 2001; Mitchell et al.,

2001; Betts et al., 2006) are rarely available at broad scales.

In recent decades, modelling the effects of land cover on

species distributions at larger spatial scales has been aided

greatly by the development of remote-sensing technology.

Satellite-borne optical remote sensors collect data from a

region at a variety of spatial resolutions. For example, the

Landsat 7 satellite captures six bands of the visible and infra-

red spectrums at resolutions of 30 m every 16 days (Kerr &

Ostrovsky, 2003; Turner et al., 2003). Satellite observations

of the earth’s surface reflectance at moderately high resolu-

tions (e.g. 30 9 30 m) and over large scales have provided

the basis for the development of data map layers that

describe discrete land-cover classes (e.g. Zinner et al., 2002;

Venier et al., 2004). In addition, indices derived from

remote-sensing data such as the Normalized Difference Vege-

tation Index (NDVI) have been used extensively in species

distribution modelling (e.g. Osborne et al., 2001; Roura-

Pascual et al., 2006; Bino et al., 2008). Techniques such as

LiDAR that provides very fine-scale habitat characterization

offer the potential for improving predictions of species distri-

butions (Swatantran et al., 2012); however, these data are

simply not available for large spatial extents.

Despite these advances, there are several limitations to be

considered in current uses of remote-sensing data (Turner

et al., 2003; Bradley & Fleishman, 2008) for species distribu-

tion modelling. First, aggregation of raw remote-sensing data

into discrete land-cover classes results in misclassification

errors (Foody, 2002; Gottschalk et al., 2005; Gillespie et al.,

2008). Perhaps more importantly, substantial information is

lost when continuous data on the earth’s reflectance are clas-

sified into discrete categories (e.g. coniferous forest, urban,

agricultural), limiting the ability of land-cover maps to

improve predictions of species distributions (Bradley & Fle-

ishman, 2008). Further, native species may not respond

directly to such human-defined boundaries (Betts et al.,

2007; Zitske et al., 2011). This problem is particularly critical

in extensive remote areas that are often categorized into a

single classification (e.g. mature boreal forest, uncut Amazo-

nian forest), so little variability in land-cover composition is

present across entire study areas. Finally, classification of raw

remote-sensing data can be expensive. Even when imagery is

freely available, there can be additional costs for processing

software and computer hardware. Importantly, classification

of images is a non-trivial task that requires technical special-

ists trained in the appropriate techniques; this renders the

use of remote-sensing imagery inaccessible to many research-

ers, particularly in developing countries. Although there are

many tools available for simple image transformation includ-

ing R tools, making useful information out of spectral data

cannot be a simple black-box operation.

One potential solution to these difficulties is to use raw

data on reflectance as explanatory variables in species distri-

bution models. Because the original information for each

pixel is retained, such a modelling approach has the potential

to improve both the spatial resolution and accuracy of

predictions and avoids the subjectivity associated with the

choice of classification method and the loss of information

associated with classifying an inherently continuous attribute.

Further, because such data are now freely available at high

temporal resolution for many areas of the earth’s surface

(http://landsat.usgs.gov/Landsat_Search_and_Download.php),

such an approach would enable researchers in regions where

no classified images currently exist to model distributions as

a function of land cover.

Here, we present results of an analysis of relationships

between species occurrences and a set of raw remote-sensing

reflectance variables using data collected as part of a study of

birds in forest landscapes of western Oregon, USA. We

hypothesized that occurrences of several bird species would

be associated with the reflectance values for certain bands.

Spectral values for different bands have been associated with

a variety of landscape characteristics. For example, values for

both red and near-infrared reflectance (bands 3 and 4) are

associated with vegetation features such as green-leaf bio-

mass, leaf area and photosynthetic activity (Tucker, 1979;

Sellers, 1985; Spanner et al., 1990), while middle-infrared

reflectance as recorded by bands 5 and 7 is sensitive to

changes in soil and plant moisture and can provide a mea-

sure of senescent biomass (Rey-Benayas & Pope, 1995; Kerr

& Ostrovsky, 2003). We expected that those bands associated

with canopy structure and biomass would be the most

important predictors of bird distributions. If predictive,

maps based on fine-scale spectral data may be relevant to

fundamental objectives in conservation biology and

2 Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd.

S. M. Shirley et al.

Page 3: Species distribution modelling for the people

landscape ecology including quantifying species-specific rates

of habitat loss, forest management planning, protected areas

design (Possingham et al., 2006) and testing the relative

importance of climate change versus habitat loss at broad

spatial scales.

METHODS

Study area

We compiled data from previous studies in two regions at

elevations ranging from sea level to 968 m in north-western

(McGarigal & McComb, 1995) and south-western Oregon,

USA (Alexander et al., 2004). The north-western Oregon

study area is situated in the Drift Creek, Lobster Creek and

Nestucca River basins covering approximately 6500 km2 of

the central Oregon Coast Range and is characterized by wet

winters and dry summers. The south-western Oregon study

area covers a wide range of ecological types ranging from

Cascade slopes to much warmer and drier Klamath Moun-

tains in areas of the Cascade-Siskiyou National Monument

and Applegate Valley. Land cover is dominated by forest pri-

marily composed of Douglas-fir (Pseudotsuga menziesii) and

mountain hemlock (Tsuga mertensiana) with deciduous

species such as red alder (Alnus rubra) and bigleaf maple

(Acer macrophyllum) in the north and oaks (mostly Quercus

garryana and Q. kelloggii), ponderosa pine (Pinus ponderosa)

and Pacific madrone (Arbutus menziesii) at low elevations in

the southern region. Disturbances are primarily from clearcut

timber harvesting in the north-western and clearcut harvest-

ing and wildfires in the south-western study areas.

Bird sampling

In the north-western region, sample points were located in a

uniform grid at 200-m intervals along transects spaced

400 m apart in 30 sub-basins (approximately 300 ha each).

Between 32 and 38 sample points were placed within each

sub-basin, for a total of 505 points. In the south-western

region, 4093 sample points separated by distances of

150–250 m were established along transects to capture eleva-

tion, forest habitat type and disturbance gradients. In both

study areas, avian data were collected using fixed-radius

point counts (Ralph, 1995) conducted between 05:30 and

10:00 Pacific Standard Time at each sample point. In north-

western Oregon, four 8-minute counts were conducted on

separate occasions between May and June in the years from

1990 to 1992. Surveys were conducted 15–20 min before

sunrise to 4 h after sunrise. South-western region point-

count data were collected from 2002 to 2005 using the same

protocols, but points were only visited one to two times dur-

ing the season. Differences in the number of visits between

studies and count duration should result in higher overall

probability of detection for the same species in north-western

than south-western Oregon (Drapeau et al., 1999). However,

all other things being equal, the shape of the response of

individual species to landscape structure should remain the

same (Betts & Villard, 2009). In our analysis, we utilized data

for all birds seen or heard within a 50 m radius. Because

mean bird counts per station tended to be low (< 2) for

most species and we were interested in estimating probability

of occurrence in a way that is consistent with other species

distribution modelling efforts (e.g. Guisan & Thuiller, 2005;

Buermann et al., 2008), we reduced relative abundance data

to presence–absence data for use in binomial models. These

combined datasets resulted in a database of 4598 spatial

sample points, with 127,164 bird detections representing 152

species. For analysis, we selected the 40 most commonly

occurring species (see Table S1 in Supporting Information).

Remote-sensing data

Landsat images corresponding to the survey year were used

for spectral analysis. All the Landsat images were acquired

from USGS (http://landsat7.usgs.gov/index.php). Images

from USGS had sub-pixel geolocation accuracies, and no fur-

ther geometric processing was applied. Although all the

images were chosen to represent the growing season (July

and August in the year of data origin), factors such as sea-

sonal phenology and atmospheric conditions etc. can make

the spectral values recorded by satellite highly variable across

images. For studies such as ours that cover a large geographi-

cal area that may have been collected over several years,

radiometric correction is essential to minimize the noise not

related to ground conditions (Song & Woodcock, 2003;

Schroeder et al., 2006). The cosine–Theta (COST) correction

of Chavez (1996) was used to remove most atmospheric

effects for a single reference image in a given Landsat time

series (LTS). Spectral values for dark objects used in the

COST calculation were based on visual assessment following

the rules outlined in protocols found in Kennedy et al.

(2007a). For each Landsat path/row used in this analysis, one

cloud-free clear image was selected from the Landsat archive

as the reference image. All the other images used were then

normalized to the COST image using the MADCAL (multi-

variate alteration detection and calibration) algorithms of

Canty et al. (2004), which identify stable pixels within a

small subset of the larger image. Clouds and cloud shadow

within each image were identified using Vegetation Change

Tracker of Huang et al. (2010). These pixels were excluded

from further analysis. For each survey point, spectral values

from Landsat bands excluding the thermal infrared band

were summarized using a 150, 500, 1000 and 2000 m radius.

Within the surrounding neighbourhood of each radius, the

mean and standard deviation for the 6 spectral bands were

summarized to produce the spectral variables for the model.

Data analysis

We developed models for 40 of the most commonly occur-

ring bird species in the dataset (based on � 10% occur-

rence). Of these species, 34 species occurred in the northern

Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd. 3

Predicting bird distributions with remote-sensing data

Page 4: Species distribution modelling for the people

region and all 40 species occurred in the southern region.

Our explanatory variables were the means and standard devi-

ations calculated for Landsat remote-sensing reflectance

bands 1, 2, 3, 4, 5 and 7 (hereafter ‘Reflectance Models’). In

addition, we chose to analyse these predictor variables at 4

scales that have been found to be relevant to passerine bird

species: 150, 500, 1000 and 2000 m (Betts et al., 2006) for a

total of 48 predictor variables. Other species distribution

studies commonly use an index derived from bands 3 and 4

known as the Normalized Difference Vegetation Index

(NDVI) (Parra et al., 2004; Gottschalk et al., 2005; Gillespie

et al., 2008; Morán-Ordónez et al., 2012). As a further com-

parison, we developed models for all species using a separate

set of covariates that included the NDVI values calculated at

each scale (hereafter ‘NDVI models’).

We analysed relationships between species distributions of

birds and remote-sensing reflectance using boosted regression

tree (BRT) models (Friedman et al., 2000; Friedman, 2001).

BRT models belong to a class of ensemble or model-averag-

ing models and their use is relatively recent for ecologists

(Leathwick et al., 2006). Unlike traditional regression-based

models that fit a single model using a response variable and

a set of predictors, BRT models start with a simple classifica-

tion or regression tree and use a boosting algorithm to itera-

tively fit new trees to the model in a forward stage-wise

fashion (Elith et al., 2008). Subsequent trees give additional

emphasis to observations poorly predicted by previous trees.

The final BRT model contains many individual model terms

each represented by a tree. BRTs offer a number of advanta-

ges for model fitting. They can be fitted to a variety of

response types including normal, count and binomial data,

and they have the ability to model nonlinear relationships

and interactions between predictor variables. Finally, the

process of boosting has a stochastic component that

improves prediction performance using a random subset of

data to fit each new tree, thereby resulting in slightly differ-

ent final models every time they are run.

Datasets including several remote-sensing predictors from

the same area tend to be highly correlated (Zimmermann

et al., 2007), and our analyses revealed substantial multicol-

linearity in the predictors. Despite this fact, we conducted

analyses using all predictor variables to retain as much infor-

mation as possible to develop relationships and avoid omit-

ting important variables. Correlated input variables are not

problematic for prediction tasks as long as the correlation

structure remains constant over the training and testing

datasets (Dormann et al., 2012). Boosted regression trees do

not encounter the numerical issues faced when fitting linear

models with collinear variables because trees are fit with

recursive partitioning algorithms instead of matrix inversions

(Breiman et al., 1984).

When fitting models with many variables, it is also impor-

tant to avoid overfitting. Overfitting occurs when a model

fits not only the signal, but also the noise in the training

data, resulting in optimistically biased estimates of model

performance on the training data. More complex models

(with more variables) will often perform better on the train-

ing set because they can fit the data more closely. To

compare models of differing complexity fairly, we report per-

formance for all models on an independent test set com-

prised of roughly half of the data points. If overfitting

occurred, we would expect more complex models to perform

more poorly than simpler models on these independent test

data.

We calculated the Moran’s I statistic as a gauge for the

degree to which spatial autocorrelation in residuals of all

BRT models influenced our results. All values were calculated

using the correlog function in the R package ncf (Bjørnstad,

2009). The average Moran’s I value for all species was 0.146,

and although this was a significant correlation (at

alpha = 0.05) in our permutation tests, the effect size was

not substantial (Moran’s I ranges from 0 to 1, with val-

ues > 0.3 considered relatively large (Lichstein et al., 2002)

(see Table S2 in Supporting Information). To minimize the

effects of spatial correlation between the training and test

sets, we used a checkerboard approach similar to those

described in Hochachka et al. (2010), Munson et al. (2010)

and Hochachka et al. (2012). We placed a 15 9 24 checker-

board over the data, resulting in grid cells of roughly

33181.5 NORTHING by 10871.39 EASTING units. The 2395

points in the white squares became the training set, and the

2203 points in the black squares became the test set.

All BRT models were fitted in R version 2.13.1 (www.R-

project.org, R Development Core Team, 2012). We supple-

mented the ‘gbm’ package (Ridgeway, 2006) with additional

functions relevant to ecological data (Elith et al., 2008

Appendix). Values for several model parameters were

selected based on information in Elith et al. (2008). The

degree of model stochasticity is controlled by the ‘bag frac-

tion’ that determines the proportion of the data to be

selected as a random subset from the full training set for

each new tree. Following Elith et al. (2008), we used a bag

fraction of 0.5 which has yielded good results for presence–

absence data. The learning rate, which controls the rate at

which additional trees are added to the model, we set to

0.01. Lower values of the learning rate increase the number

of trees required such that each tree has lower individual

influence giving an overall better fit to the model (Friedman,

2001). Tree complexity refers to the number of nodes or

decision rules and controls the level of model interactions

between predictor variables; this was set to 5 in our models.

We assessed the optimal number of trees for each species

using a tenfold cross-validation procedure (Hastie et al.,

2001) where 9/10 of the training data are used for model fit-

ting, 1/10 is withheld for validation, and the process is

repeated ten times. We evaluated prediction performance of

the models using the area under the receiver operating char-

acteristic curve (AUC) on the independent test set, for which

we computed 95% DeLong confidence intervals using the

‘pROC’ R package (Robin et al., 2011).

Using the gbm package, we also calculated the relative

influence of each predictor; this provides a measure of the

4 Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd.

S. M. Shirley et al.

Page 5: Species distribution modelling for the people

strength of each variable’s influence on the total response

and is reflected as a proportion. We summarized the relative

influence of each predictor variable on the model for the

reflectance models and compared the relative influence of

predictor variables across the six bands and four scales using

a Kruskal–Wallis analysis of variance for non-normally

distributed data.

RESULTS

Species distribution models

Prediction success for the reflectance models was high, with

an average AUC value across all species of 0.79 (SD = 0.10)

and ranging from 0.56 to 0.98 (Fig. 1). Prediction accuracies

for 80% of species were above AUC = 0.70, the value consid-

ered necessary for models to have acceptable discriminatory

power (Hosmer & Lemeshow, 2000) with almost 40% of

models having excellent prediction success (AUC � 0.80).

Several of the species with the highest prediction values were

associated with old-growth forest (e.g. Pacific-Slope Fly-

catcher; Fig. 1) or have highly disjunct distributions between

northern and southern areas. This raised the possibility that

high model performance was partly due to differences in

reflectance between study areas. To some extent, these regio-

nal differences in reflectance represent an ecological signal

on which the model can, and should, make distinctions.

However, for species that only occurred in the southern

region, including the absences from the northern region with

different characteristics might make the prediction task

easier. Our interest was in testing whether predictions were

accurate at fine resolutions rather than simply distinguishing

regional differences. To investigate this issue, we ran trained

and tested models on just the data from the southern region

for the six species that never occurred in the northern

region. The average reduction in AUC over these six species

from the original models to the southern region models was

0.03 (standard deviation 0.02; Shirley and Betts unpublished

data). This small difference, which may be partially attribut-

able to the reduced size of the dataset, indicates that regional

differences in reflectance are not a key driver of prediction

accuracy in our study. There was also no correlation between

prevalence and model performance; relatively less common

species performed as well as more common species

(t = �0.668, P = 0.508).

Compared with the reflectance models, the NDVI models

had a significantly lower prediction success (Mann–Whitney

test = 2157.00, P � 0.001) with an average AUC value

across all species of 0.70 (SD = 0.13) and ranging from 0.48

to 0.95. Prediction accuracies for 37% of species were above

AUC = 0.70, the value considered necessary for models to

have acceptable discriminatory power (Hosmer & Leme-

show, 2000) with 20% of models having excellent prediction

success (AUC � 0.80). The superior performance of the

reflectance models over the NDVI models could be due to

information in the bands not used to produce NDVI. Alter-

natively, higher performance in band-only models could

result from additional information in bands 3 and 4 that

are lost when the data are classified to NDVI. To test these

hypotheses, we developed models using only a subset of

Figure 1 The results of species distribution models showing the area under the curve (AUC � CI) for boosted regression tree models

for the 40 most commonly occurring species based on covariates including all spectral reflectances (filled circles) and NDVI index values

(open circles). The dashed line indicates the predicted accuracy of the null model. See Table S1 for full species names.

Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd. 5

Predicting bird distributions with remote-sensing data

Page 6: Species distribution modelling for the people

predictors (bands 3 and 4 only [16 variables]); the predic-

tion accuracies of the subset models were very similar to

those of the larger reflectance models (AUC = 0.78) even

with substantially fewer predictor variables. These results

indicate that the NDVI models do not contain as much

information as the raw data from bands 3 and 4 and that

while there is some benefit to including the other 4 bands

in the model, most of the predictive performance rests on

bands 3 and 4.

Relative influence of predictors on model

performance

We summarized the relative influence of predictors by bands

and scale. The average contribution of each reflectance band

to model predictions varied substantially (Fig. 2a, H = 82.89,

n = 40, P � 0.001). Across all species, the relative influence

of band 4 predictors was over three times greater than the

next highest band. The mean from band 4 generally had

greater influence than the standard deviation (65% vs. 35%).

In particular, the average band 4 reflectance at the 2000-m

scale was selected as a top five predictor variable for 60% of

species. The spatial scales we considered as predictors also

differed substantially in their contribution to prediction suc-

cess (Fig. 2b, H = 73.70, n = 40, P � 0.001); the average

influence for predictors at the 2000-m scale was 4.95 times

higher than the next scale. All 40 species had at least one top

five predictor at the 2000-m scale, while predictors at the

other scales had influence for roughly half of the species’

models. Considering each species individually, the relative

influence of predictor variables was fairly even in magnitude

across the predictors. However, for 7/40 species, especially

those with strong associations to old-growth conifer forest,

the range of relative influence values was much more diverse

(e.g. relative influence of both high and low magnitude),

with one predictor (band 4 mean at 2000 m) being the

dominant influence.

DISCUSSION

Use of raw reflectance data for prediction accuracy

The use of remote sensing by ecologists is growing as its

potential for addressing the consequences of large-scale envi-

ronmental change on biodiversity increases. In the last few

decades, a number of remote-sensing technologies have been

used in a wide variety of applications including identifying

habitat relationships (Johnson et al., 1998), quantifying bio-

diversity (Kerr et al., 2001; Levin et al., 2007), modelling

species distributions (Achard et al., 2002; Buermann et al.,

2008) and conservation planning (Rey-Benayas & Pope,

1995; Friedlander et al., 2007). One review compiled infor-

mation on over 120 studies using remote-sensing technology

for analysing avian habitat relationships (Gottschalk et al.,

2005). Despite the benefits offered by remote-sensing tech-

nology, the associated costs and requirement for expertise

preclude their use in many ecological studies. Many types of

spectral data are technically difficult to collect and analyse,

have resolutions that are too coarse to use for fine-scale

land-cover patterns (e.g. MODIS, 250–500 m) or have been

classified into maps that are only available for specific peri-

ods; for example, the US National Land Cover Database

covers large spatial scales, but it is only available for the

years 1992, 2001 and 2006.

We demonstrate that variables obtained from raw unclassi-

fied remotely sensed imagery can be used to produce species

distribution models with high prediction ability and for a

wider temporal range than would be possible for classified

imagery products (Fig. 3). The overall prediction accuracy of

similar studies of avian habitat relationships using a classifi-

cation process varies between 60 and 99% with a mean of

85% (Gottschalk et al., 2005); however, actual classification

accuracy is often lower than expected or unknown due to

the high cost associated with assessment (Shao & Wu, 2008).

In contrast, using raw imagery avoids uncertainty associated

Figure 2 Mean (� SE) relative influence (%) of Landsat TM band covariates across all species grouped by (a) reflectance band and (b)

remote-sensing reflectance scale for boosted regression tree models. The values reflect the number of times a variable is selected for

splitting and weighted by improvements to the model (Friedman, 2001). Higher values indicate a stronger influence on the response

variable.

6 Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd.

S. M. Shirley et al.

Page 7: Species distribution modelling for the people

with a loss of classification accuracy due to errors such as

the misclassification of habitat types to pixels (White et al.,

1997; Shao & Wu, 2008) as well as the omission of fine-scale

local features (e.g. Gottschalk et al., 2005) and subtle changes

in vegetation. In addition, the use of all spectral reflectance

bands as independent predictors as opposed to indices that

focus on a narrow range of the spectrum may improve pre-

dictions by allowing models to select from a wider set of

band combinations (Morán-Ordóñez et al., 2012). The fine-

scale spectral resolution of raw remote-sensing data used in

our study may contribute to the high values of prediction

accuracy observed for most species in our analysis.

One potential issue with the use of raw bands is the

potential error due to topographical ‘wild areas’, where sha-

dow effects occur due to topographical complexities in slope

and aspect. It is interesting to note that despite this potential

error, our models performed well on independent data.

However, it is also possible that because wild areas are con-

sistently placed across landscapes (e.g. north facing slopes),

these shadows contain biological information that is relevant

to modelling species distributions.

Influence of variables on model prediction

We assessed the relative strength of the reflectance variables

in terms of band and scale of data aggregation. While inter-

pretation of models trained on non-independent input vari-

ables must be carried out with caution, relative influence

metrics can still provide some insight into the inner work-

ings of boosted regression tree models. There has been rela-

tively little analysis of multispecies or communities using

remote-sensing reflectance data (Gottschalk et al., 2005;

Zimmermann et al., 2007). Most studies that have used

reflectance data have used derived vegetation indices (Baret

& Guyot, 1991), NDVI (Gottschalk et al., 2005; Gillespie

et al., 2008) or have been focused on a single species. To our

knowledge, our study is the first to identify some general

patterns in the usefulness of spectral reflectances for species

distribution modelling of multiple bird species within a

region. We found that, across all species, reflectance in band

4 and variables at the scale of 2000 m were the predictors

with the highest relative influence. Band 4 represents reflec-

tance in the near-infrared region of the visible spectrum

(Kerr & Ostrovsky, 2003) and is associated with vegetation

biomass and photosynthetic activity (Sellers, 1985; Spanner

et al., 1990). In addition, in central America, the band 4

radiance was the most important predictor for determining

landscape diversity patterns (Rey-Benayas & Pope, 1995).

Variables derived from satellite images (NDVI-based pre-

dictors) measured at coarser scales have also been shown to

be important for species richness of both boreal plants and

butterflies (Parviainen et al., 2009; Levanoni et al., 2011).

The reasons for these patterns are not well understood, but

it is thought that variables estimated over larger scales better

capture landscape-scale processes such as patch colonization

and habitat selection. The largest of the spatial extents we

(a) (b) (c) (d)

Figure 3 Predicted potential spatial distribution in western Oregon of the Olive-sided Flycatcher (Contopus cooperi) [(a) and (b)] and

Pacific-slope Flycatcher (Empidonax difficilis) [(c) and (d)] in 1995 and 2005 estimated from boosted regression tree models using

Landsat TM spectral reflectance bands as predictor variables. Colours refer to probability of occurrence where yellow indicates the

highest probability and black the lowest probability. In this example, mean reflectance for band 4 at 150 and 2000 m was the predictors

with the largest relative influence for the Olive-sided Flycatcher and Pacific-slope Flycatcher, respectively.

Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd. 7

Predicting bird distributions with remote-sensing data

Page 8: Species distribution modelling for the people

selected likely include the scale relevant to bird species in

natal dispersal (Bowman, 2003) and extraterritorial move-

ments during the breeding season (Norris & Stutchbury,

2001). In a previous effort from Oregon, the 2000 m spatial

extent also emerged as the most important spatial scale

(Betts et al., 2010). Importantly, this study used different

independent data (land-cover classes) and modelled distribu-

tions separately for southern and northern study areas.

Response predictor variables summarized as mean values had

the greatest overall relative influence, although measures of

variation were also important in some instances. This sug-

gests that for many species, the average vegetation composi-

tion around the sample point was more important than the

heterogeneity in vegetation cover.

Applications of models using raw remote-sensing

data

Unclassified Landsat data offer several advantages for use in

species distributions models. Landsat 7 imagery provides data

at a relatively fine spatial resolution of 900 m2 (30 9 30 m),

but these data are available at global spatial extents, so the

typical trade-off between resolution and extent of spatial data

(Betts et al., 2006) is not necessary. Further, images are now

free to the public and easy to obtain. These images are avail-

able on an annual basis offering opportunities for ecologists

to easily access multi-year data over global scales. Indeed,

remote-sensing data are a necessary and sometimes the only

source of land-cover data for many regions of the world. For

example, remote mountainous areas of high conservation

interest can have low economic value in terms of resource

availability, but provide important ecological services; they

are reservoirs of water and biological diversity (Harrison

et al., 2010). Recent developments in remote-sensing process-

ing using open source software such as Quantum GIS

(http://www.qgis.org/) and GRASS GIS (http://grass.osgeo.

org) have made the process of radiometric correction more

accessible and reduced subjectivity associated with visual

assessments to identify dark objects. Using raw remote-sens-

ing technology allows us to develop urgently needed esti-

mates of species distributions in a cost-effective manner and

is critical to the management and conservation of species

over large scales.

The need for fine-scale resolution data for predicting

potential influences of land-cover change on species distribu-

tions is only starting to be fully appreciated. Land-cover pro-

cesses such as forest management, agricultural and urban

development usually happen incrementally at fine resolutions

(e.g. most clearcuts in temperate zones are < 100 ha (McGa-

rigal & McComb, 1995; Betts et al., 2003); changes in spatial

patterns occurring as a result of these processes could not be

detected using resolutions > 1 km (e.g. MODIS) until the

majority of the habitat had been removed (Balmford et al.,

2003). Small-bodied species likely respond to the landscape at

fine resolutions, for example, individual canopy gaps (McGa-

rigal & McComb, 1995; Betts et al., 2006), which could be

missed with data collected at broad spatial scales. Further,

fine-scale spatial data are needed to predict the precise loca-

tion of a species and its association. For example, ‘fine-filter’

strategies for biodiversity conservation (Scott et al., 1993)

often require identification of sites at scales much finer than

those permissible by coarser remotely sensed data such as

MODIS. Recent advances in remote-sensing technology using

very high-resolution LiDAR in combination with other satel-

lite imagery have shown high prediction success from being

able to produce multidimensional habitat structure maps

(Swatantran et al., 2012); however, LiDAR data are rarely

available at large spatial extents. Our approach of using raw

annual imagery also allows for a fine-scale modelling of spe-

cies distributions as illustrated in Fig. 3 without having to be

constrained by the temporal ranges associated with classified

imagery products (e.g. NLCD 1992 and 2001).

The use of raw reflectances in modelling species distribu-

tions may be limited to certain applications. Unlike detailed

quantification of vegetation structure and composition avail-

able from ground-sampled plots or classified imagery prod-

ucts, raw reflectances are not easily interpretable by ecologists

which may limit their ecological relevance in uncovering spe-

cies–vegetation associations (Ollinger, 2011). Because climate,

soil conditions and vegetation structure may affect the rela-

tionships between bird occurrences and Landsat reflectances,

specific models developed for one image are not necessarily

easily applied to other images (Nagendra, 2001). As in most

instances where species distribution models are applied

beyond the spatial and temporal extents of the existing data,

predictions should be carefully evaluated and used with an

appropriate degree of caution. Despite these limitations, we

conclude that raw remote-sensing data show great promise

for strengthening applications that have previously made use

of remote-sensing data such as forest management planning

(Achard et al., 2002) and habitat change (Kerr & Ostrovsky,

2003; Wolter et al., 2008), identification of biological hotspots

at fine resolutions, and reserve selection (Schulman et al.,

2007). Most importantly, these data are freely obtained, rela-

tively easy to use and available for many areas of the globe.

ACKNOWLEDGEMENTS

This study was supported by a US National Science Founda-

tion grant (NSF-ARC-0941748) to MGB and grant

G11AC20255 from the United States Geological Survey

(USGS). Thanks to Tom Dietterich for helpful conversations.

The authors are grateful to Janet Franklin, Benjamin Zucker-

berg and two anonymous reviewers who provided helpful

comments on this manuscript.

REFERENCES

Achard, F., Eva, H.D., Stibig, H.J., Mayaux, P., Gallego, J.,

Richards, T. & Malingreau, J.P. (2002) Determination of

deforestation rates of the world’s humid tropical forests.

Science, 297, 999–1002.

8 Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd.

S. M. Shirley et al.

Page 9: Species distribution modelling for the people

Alexander, J.D., Ralph, C.J., Hollinger, K. & Hogoboom, B.

(2004) Using a wide-scale landbird monitoring network to

determine landbird distribution and productivity in the

Klamath Bioregion. Proceedings of the Second Conference on

Klamath-Siskiyou Ecology (ed. by K.L. Mergenthaler, J.E.

Williams, and J. Jules), pp. 33–41. Siskiyou Field Institute,

Cave Junction, Oregon.

Allan, J.D. (2004) Landscapes and riverscapes: the influence

of land use on stream ecosystems. Annual Review of Ecology

Evolution and Systematics, 35, 257–284.

Balmford, A., Green, R.E. & Jenkins, M. (2003) Measuring

the changing state of nature. Trends in Ecology & Evolution,

18, 326–330.

Baret, F. & Guyot, G. (1991) Potentials and limits of vegeta-

tion indexes for Lai and Apar assessment. Remote Sensing

of Environment, 35, 161–173.

Betts, M.G. & Villard, M.A. (2009) Landscape thresholds in

species occurrence as quantitative targets for forest

management: generality in space and time? Setting conser-

vation targets for managed forest landscapes (ed. by M.A.

Villard), pp. 185–206. Cambridge University Press,

Cambridge, UK.

Betts, M.G., Franklin, S.E. & Taylor, R.G. (2003) Interpreta-

tion of landscape pattern and habitat change for local

indicator species using satellite imagery and geographic

information system data in New Brunswick, Canada.

Canadian Journal of Forest Research, 33, 1821–1831.

Betts, M.G., Diamond, A.W., Forbes, G.J., Villard, M.A. &

Gunn, J.S. (2006) The importance of spatial autocorrela-

tion, extent and resolution in predicting forest bird

occurrence. Ecological Modelling, 191, 197–224.

Betts, M.G., Forbes, G.J. & Diamond, A.W. (2007) Thresh-

olds in songbird occurrence in relation to landscape

structure. Conservation Biology, 21, 1046–1058.

Betts, M.G., Hagar, J.C., Rivers, J.W., Alexander, J.D., McGa-

rigal, K. & McComb, B.C. (2010) Thresholds in forest bird

occurrence as a function of the amount of early-seral

broadleaf forest at landscape scales. Ecological Applications,

20, 2116–2130.

Bino, G., Levin, N., Darawshi, S., Van Der Hal, N., Reich-

Solomon, A. & Kark, S. (2008) Accurate prediction of bird

species richness patterns in an urban environment using

Landsat-derived NDVI and spectral unmixing. International

Journal of Remote Sensing, 29, 3675–3700.

Bjørnstad, O.N. (2009) ncf: spatial nonparametric covariance

functions. Documentation on the R Package ‘ncf’, version

1.1-3 http://cran.stat.sfu.ca/ (accessed 23 May 2012).

Bowman, J. (2003) Is dispersal distance of birds propor-

tional to territory size? Canadian Journal of Zoology, 81,

195–202.

Bowman, J., Forbes, G. & Dilworth, T. (2001) Landscape

context and small-mammal abundance in a managed for-

est. Forest Ecology and Management, 140, 249–255.

Bradley, B.A. & Fleishman, E. (2008) Can remote sensing of

land cover improve species distribution modelling? Journal

of Biogeography, 35, 1158–1159.

Breiman, L., Friedman, J.H., Olshen, R.A. & Stone, C.J.

(1984) Classification and regression trees. Wadsworth &

Brooks/Cole Advanced Books & Software, Monterey, CA.

Buermann, W., Saatchi, S., Smith, T.B., Zutta, B.R., Chaves,

J.A., Mila, B. & Graham, C.H. (2008) Predicting species

distributions across the Amazonian and Andean regions

using remote sensing data. Journal of Biogeography, 35,

1160–1176.

Canty, M.J., Nielsen, A.A. & Schmidt, M. (2004) Automatic

radiometric normalization of multitemporal satellite imag-

ery. Remote Sensing of Environment, 91, 441–451.

Chavez, P.S. (1996) Image-based atmospheric corrections

revisited and improved. Photogrammetric Engineering and

Remote Sensing, 62, 1025–1036.

Coops, N.C., Wulder, M.A. & Iwanicka, D. (2009) Exploring

the relative importance of satellite-derived descriptors of

production, topography and land cover for predicting

breeding bird species richness over Ontario, Canada.

Remote Sensing of Environment, 113, 668–679.

Dale, V.H. (1997) The relationship between land-use change

and climate change. Ecological Applications, 7, 753–769.

Dickinson, R.E. (1991) Global change and terrestrial hydrol-

ogy - a review. Tellus, 43, 176–181.

Dormann, C.F., Elith, J., Bacher, S., Buchmann, C., Carl, G.,

Carr�e, G., Garcia Marqu�ez, J.R., Gruber, B., Lafourcade, B.,

Leit~ao, P.J., M€unkem€uller, T., McClean, C., Osborne, P.E.,

Reineking, B., Schr€oder, B., Skidmore, A.K., Zurell, D. &

Lautenbach, S. (2012) Collinearity: a review of methods to

deal with it and a simulation study evaluating their perfor-

mance. Ecography, 35, 1–20.

Drapeau, P., Leduc, A. & McNeil, R. (1999) Refining the use

of point counts at the scale of individual points in studies

of bird-habitat relationships. Journal of Avian Biology, 30,

367–382.

Elith, J., Leathwick, J.R. & Hastie, T. (2008) A working guide

to boosted regression trees. Journal of Animal Ecology, 77,

802–813.

Foody, G.M. (2002) Status of land cover classification accu-

racy assessment. Remote Sensing of Environment, 80, 185–

201.

Friedlander, A.M., Brown, E.K. & Monaco, M.E. (2007)

Coupling ecology and GIS to evaluate efficacy of marine

protected areas in Hawaii. Ecological Applications, 17,

715–730.

Friedman, J.H. (2001) Greedy function approximation:

a gradient boosting machine. Annals of Statistics, 29, 1189–

1232.

Friedman, J., Hastie, T. & Tibshirani, R. (2000) Additive

logistic regression: a statistical view of boosting. Annals of

Statistics, 28, 337–374.

Gillespie, T.W., Foody, G.M., Rocchini, D., Giorgi, A.P. &

Saatchi, S. (2008) Measuring and modelling biodiversity

from space. Progress in Physical Geography, 32, 203–

221.

Gottschalk, T.K., Huettmann, F. & Ehlers, M. (2005) Thirty

years of analysing and modelling avian habitat relationships

Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd. 9

Predicting bird distributions with remote-sensing data

Page 10: Species distribution modelling for the people

using satellite imagery data: a review. International Journal

of Remote Sensing, 26, 2631–2656.

Guisan, A. & Thuiller, W. (2005) Predicting species distribu-

tion: offering more than simple habitat models. Ecology

Letters, 8, 993–1009.

Harrison, P.A., Vandewalle, M., Sykes, M.T., Berry, P.M.,

Bugter, R., de Bello, F., Feld, C.K., Grandin, U., Harring-

ton, R., Haslett, J.R., Jongman, R.H.G., Luck, G.W., da

Silva, P.M., Moora, M., Settele, J., Sousa, J.P. & Zobel, M.

(2010) Identifying and prioritising services in European

terrestrial and freshwater ecosystems. Biodiversity and Con-

servation, 19, 2791–2821.

Hastie, T., Tibshirani, R. & Friedman, J.H. (2001) The ele-

ments of statistical learning: data mining, inference, and pre-

diction. Springer-Verlag, New York.

Hochachka, W.M., Fink, D. & Kelling, S. (2010) Checklist

programs as a source of data for bird monitoring:

designing analyses and model validations to account for

unequal spatial and temporal sampling effort. Proceedings

of the 18th Conference of the European Bird Census Coun-

cil (ed. by A. Anselin), pp. 9–20. Bird Census News, Cac-

eres, Spain.

Hochachka, W.M., Fink, D. & Zuckerberg, B. (2012) Use of

citizen-science monitoring for pattern discovery and bio-

logical inference. Design and analysis of long-term ecological

monitoring studies (ed. by R.A. Gitzen, A.B. Cooper, J.J.

Millspaugh and D.S. Licth), pp. 460–477. Cambridge Uni-

versity Press, Cambridge, UK.

Hosmer, D.W. & Lemeshow, S. (2000) Applied logistic

regression, 2nd edn. John Wiley and Sons, New York,

USA.

Huang, C.Q., Coward, S.N., Masek, J.G., Thomas, N., Zhu,

Z.L. & Vogelmann, J.E. (2010) An automated approach for

reconstructing recent forest disturbance history using dense

Landsat time series stacks. Remote Sensing of Environment,

114, 183–198.

Jetz, W., Wilcove, D.S. & Dobson, A.P. (2007) Projected

impacts of climate and land-use change on the global

diversity of birds. PloS Biology, 5, 1211–1219.

Johnson, D.D.P., Hay, S.I. & Rogers, D.J. (1998) Contempo-

rary environmental correlates of endemic bird areas derived

from meteorological satellite sensors. Proceedings of the

Royal Society of London Series B, 265, 951–959.

Kennedy, R.E., Cohen, W.B., Kirschbaum, A.A. & Haunreit-

er, E. (2007a). Protocol for Landsat-based monitoring of

landscape dynamics at North Coast and Cascades Network

parks. US Geological Survey Techniques and Methods 2-G1

Reston, VA, USA.

Kerr, J.T. & Ostrovsky, M. (2003) From space to species:

ecological applications for remote sensing. Trends in Ecol-

ogy & Evolution, 18, 299–305.

Kerr, J.T., Southwood, T.R.E. & Cihlar, J. (2001) Remo-

tely-sensed habitat diversity predicts butterfly species

richness and community similarity in Canada. Proceedings

of the National Academy of Sciences USA, 98, 11365–

11370.

Leathwick, J.R., Elith, J., Francis, M.P., Hastie, T. & Taylor,

P. (2006) Variation in demersal fish species richness in the

oceans surrounding New Zealand: an analysis using

boosted regression trees. Marine Ecology-Progress Series,

321, 267–281.

Lemoine, N., Bauer, H.G., Peintinger, M. & B€ohning-Gaese,

K. (2007) Effects of climate and land-use change on species

abundance in a central European bird community. Conser-

vation Biology, 21, 495–503.

Levanoni, O., Levin, N., Pe’er, G., Turbe, A. & Kark, S.

(2011) Can we predict butterfly diversity along an elevation

gradient from space? Ecography, 34, 372–383.

Levin, N., Shmida, A., Levanoni, O., Tamari, H. & Kark, S.

(2007) Predicting mountain plant richness and rarity from

space using satellite-derived vegetation indices. Diversity

and Distributions, 13, 692–703.

Lichstein, J.W., Simons, T.R., Shriner, S.A. & Franzreb, K.E.

(2002) Spatial autocorrelation and autoregressive models in

ecology. Ecological Monographs, 72, 445–463.

McGarigal, K. & McComb, W.C. (1995) Relationships

between landscape structure and breeding birds in the Ore-

gon Coast Range. Ecological Monographs, 65, 235–260.

Mitchell, M.S., Lancia, R.A. & Gerwin, J.A. (2001) Using

landscape-level data to predict the distribution of birds on

a managed forest: effects of scale. Ecological Applications,

11, 1692–1708.

Mladenoff, D.J., Sickley, T.A. & Wydeven, A.P. (1999) Pre-

dicting gray wolf landscape recolonization: logistic regres-

sion models vs. new field data. Ecological Applications, 9,

37–44.

Mor�an-Ord�o~nez, A., Su�arez-Seoane, S., Elith, J., Calvo, L. &

de Luis, E. (2012) Satellite surface reflectance improves

habitat distribution mapping: a case study on heath and

shrub formation in the Cantabrian Mountains (NW

Spain). Diversity and Distributions, 18, 588–602.

Munson, M.A., Caruana, R., Fink, D., Hochachka, W.M.,

Iliff, M., Rosenberg, K.V., Sheldon, D., Sullivan, B.L.,

Wood, C. & Kelling, S. (2010) A method for measuring

the relative information content of data from different

monitoring protocols. Methods in Ecology and Evolution, 1,

263–273.

Nagendra, H. (2001) Using remote sensing to assess bio-

diversity. International Journal of Remote Sensing, 22, 2377–

2400.

Norris, D.R. & Stutchbury, B.J.M. (2001) Extraterritorial

movements of a forest songbird in a fragmented landscape.

Conservation Biology, 15, 729–736.

Ollinger, S.V. (2011) Sources of variability in canopy reflec-

tance and the convergent properties of plants. New Phytol-

ogist, 189, 375–394.

Opdam, P. & Wascher, D. (2004) Climate change meets hab-

itat fragmentation: linking landscape and biogeographical

scale levels in research and conservation. Biological Conser-

vation, 117, 285–297.

Osborne, P.E., Alonso, J.C. & Bryant, R.G. (2001) Modelling

landscape-scale habitat use using GIS and remote sensing:

10 Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd.

S. M. Shirley et al.

Page 11: Species distribution modelling for the people

a case study with great bustards. Journal of Applied Ecology,

38, 458–471.

Parra, J.L., Graham, C.C. & Freile, J.F. (2004) Evaluating

alternative data sets for ecological niche models of birds in

the Andes. Ecography, 27, 350–360.

Parviainen, M., Luoto, M. & Heikkinen, R.K. (2009) The role

of local and landscape level measures of greenness in mod-

elling boreal plant species richness. Ecological Modelling,

220, 2690–2701.

Possingham, H.P., Wilson, K.A., Andelman, S.J. & Vynne,

C.H. (2006) Protected areas: goals, limitations, and design.

Principles of Conservation Biology (ed. by M.J. Groom, G.K.

Meffe and C.R. Carroll), pp. 509–533. Sinauer Associates

Inc., Sunderland, MA.

R Development Core Team. (2012). R: a language and

environment for statistical computing. R Foundation for Sta-

tistical Computing, Vienna, Austria. http://www.R-project.

org.

Ralph, C.J. (1995). Monitoring bird populations by point

counts. General Technical Report PSW-GTR-149, United

States Department of Agriculture Forest Service, Pacific

Southwest Research Station.

Rey-Benayas, J.M. & Pope, K.O. (1995) Landscape ecology

and diversity patterns in the seasonal tropics from landsat

TM imagery. Ecological Applications, 5, 386–394.

Ridgeway, G. (2006) gbm: generalized boosted regression

models. Documentation on the R Package ‘gbm’, version

1.5-6 http://cran.r project.org/web/packages/gbm/gbm.pdf

(accessed 23 May 2012).

Robin, X., Turck, N., Hainard, A., Tiberti, N., Lisacek, F.,

Sanchez, J.C. & Muller, M. (2011) pROC: an open-source

package for R and S plus to analyze and compare ROC

curves. BMC Bioinformatics, 12, 1–8.

Roura-Pascual, N., Suarez, A.V., McNyset, K., G�omez, C.,

Pons, P., Touyama, Y., Wild, A.L., Gascon, F. & Peterson,

A.T. (2006) Niche differentiation and fine-scale projections

for Argentine ants based on remotely sensed data. Ecologi-

cal Applications, 16, 1832–1841.

Schroeder, T.A., Cohen, W.B., Song, C.H., Canty, M.J. &

Yang, Z.Q. (2006) Radiometric correction of multi-tempo-

ral Landsat data for characterization of early successional

forest patterns in western Oregon. Remote Sensing of Envi-

ronment, 103, 16–26.

Schulman, L., Ruokolainen, K., Junikka, L., S€a€aksj€arvi, I.E.,

Salo, M., Juvonen, S.K., Salo, J. & Higgins, M. (2007)

Amazonian biodiversity and protected areas: do they meet?

Biodiversity and Conservation, 16, 3011–3051.

Scott, J.M., Davis, F., Csuti, B., Noss, R., Butterfield, B.,

Groves, C., Anderson, H., Caicco, S., Derchia, F., Edwards,

T.C., Ulliman, J. & Wright, R.G. (1993) Gap analysis - a

geographic approach to protection of biological diversity.

Wildlife Monographs, 123, 1–41.

Sellers, P.J. (1985) Canopy reflectance, photosynthesis and

transpiration. International Journal of Remote Sensing, 6,

1335–1372.

Shao, G.F. & Wu, J.G. (2008) On the accuracy of landscape

pattern analysis using remote sensing data. Landscape Ecol-

ogy, 23, 505–511.

Soberon, J. (2007) Grinnellian and Eltonian niches and geo-

graphic distributions of species. Ecology Letters, 10, 1115–

1123.

Song, C.H. & Woodcock, C.E. (2003) Monitoring forest suc-

cession with multitemporal Landsat images: factors of

uncertainty. IEEE Transactions on Geoscience and Remote

Sensing, 41, 2557–2567.

Spanner, M.A., Pierce, L.L., Peterson, D.L. & Running, S.W.

(1990) Remote-sensing of temperate coniferous forest leaf-

area index - the influence of canopy closure, understory

vegetation and background reflectance. International Jour-

nal of Remote Sensing, 11, 95–111.

Swatantran, A., Dubayah, R., Goetz, S., Hofton, M., Betts,

M.G., Sun, M., Simard, M. & Holmes, R. (2012) Mapping

migratory bird prevalence using remote sensing data

fusion. PLoS One, 7, e28922.

Thuiller, W., Albert, C., Ara�ujo, M.B., Berry, P.M., Cabeza,

M., Guisan, A., Hickler, T., Midgely, G.F., Paterson, J.,

Schurr, F.M., Sykes, M.T. & Zimmermann, N.E. (2008)

Predicting global change impacts on plant species’ distribu-

tions: future challenges. Perspectives in Plant Ecology Evolu-

tion and Systematics, 9, 137–152.

Tucker, C.J. (1979) Red and photographic infrared linear

combinations for monitoring vegetation. Remote Sensing of

Environment, 8, 127–150.

Turner, W., Spector, S., Gardiner, N., Fladeland, M., Sterling,

E. & Steininger, M. (2003) Remote sensing for biodiversity

science and conservation. Trends in Ecology & Evolution,

18, 306–314.

Venier, L.A., Pearce, J., McKee, J.E., McKenney, D.W. & Ni-

emi, G.J. (2004) Climate and satellite-derived land cover

for predicting breeding bird distribution in the Great Lakes

Basin. Journal of Biogeography, 31, 315–331.

White, D., Minotti, P.G., Barczak, M.J., Sifneos, J.C., Free-

mark, K.E., Santelmann, M.V., Steinitz, C.F., Kiester, A.R.

& Preston, E.M. (1997) Assessing risks to biodiversity from

future landscape change. Conservation Biology, 11, 349–360.

Wilcove, D.S., Rothstein, D., Dubow, J., Phillips, A. & Losos,

E. (1998) Quantifying threats to imperiled species in the

United States. BioScience, 48, 607–615.

Wolter, P.T., Townsend, P.A., Sturtevant, B.R. & Kingdon,

C.C. (2008) Remote sensing of the distribution and abun-

dance of host species for spruce budworm in Northern

Minnesota and Ontario. Remote Sensing of Environment,

112, 3971–3982.

Zimmermann, N.E., Edwards, T.C., Moisen, G.G., Frescino,

T.S. & Blackard, J.A. (2007) Remote sensing-based predic-

tors improve distribution models of rare, early successional

and broadleaf tree species in Utah. Journal of Applied

Ecology, 44, 1057–1067.

Zinner, D., Pelaez, F. & Torkler, F. (2002) Distribution and

habitat of grivet monkeys (Cercopithecus aethiops aethiops)

Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd. 11

Predicting bird distributions with remote-sensing data

Page 12: Species distribution modelling for the people

in eastern and central Eritrea. African Journal of Ecology,

40, 151–158.

Zitske, B.P., Betts, M.G. & Diamond, A.W. (2011) Negative

effects of habitat loss on survival of migrant warblers in a

forest mosaic. Conservation Biology, 25, 993–1001.

SUPPORTING INFORMATION

Additional Supporting Information may be found in the

online version of this article:

Table S1 List of bird species used to assess the usefulness of

raw unclassified remote-sensing data in predicting species

distributions in Oregon, USA.

Table S2 Results of the Moran’s I test for each bird species

used to assess spatial autocorrelation in the residuals for BRT

models for predicting species distributions in Oregon, USA.

BIOSKETCH

Susan Shirley and Matthew Betts, based in the Depart-

ment of Forest Ecosystems and Society at Oregon State

University (http://www.fsl.orst.edu/flel/index.htm), lead a

research team directed at understanding climatic and land-

use factors underlying the spatial and temporal distribution

of bird species in North America.

Author contributions: S.S. and M.G.B. conceived the ideas; S.S.,

Z.Y., K.M., J.A. and M.G.B. collected the data; S.S., M.G.B. and

R.H. analysed the data; and S.S. and M.G.B. led the writing.

Editor: Janet Franklin

12 Diversity and Distributions, 1–12, ª 2013 John Wiley & Sons Ltd.

S. M. Shirley et al.