data needs and challenges for the present and future of ... · data needs and challenges for the...
TRANSCRIPT
![Page 1: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/1.jpg)
Data Needs and Challenges for the Present
and Future of Species Distribution Modelling
Robert P. Anderson, City University of New York, USA
Miguel Araujo, Museo Nacional de Ciencias Naturales, Spain
Antoine Guisan, University of Lausanne, Switzerland
Jorge M. Lobo, Museo Nacional de Ciencias Naturales, Spain
Enrique Martinez-Meyer, Universidad Nacional Autonoma de Mexico, Mexico
A. Townsend Peterson, University of Kansas, USA
Jorge Soberon, University of Kansas, USA – Group leader
with support from Dmitry Schigel, GBIF Secretariat
Task Group on GBIF Data Fitness for Use in Distribution Modeling
![Page 2: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/2.jpg)
Biodiversity data quality symposium
Sao Paulo, March 2016
A Shift from Quantity to Quality in Primary Biodiversity
Data
Jorge Miguel Lobo
Dpto. Biogeography and Global Change
Nacional de Ciencias Naturales
C.S.I.C. Madrid.
![Page 3: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/3.jpg)
$T$T
$T
$T
$T
$T$T
$T
$T$T
$T$T
$T
$T
$T
$T
$T $T$T$T
$T
$T
$T
$T
$T
$T
$T $T$T
$T$T
$T
$T
$T$T$T$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T$T
$T
$T
$T$T$T$T
$T
$T
$T$T
$T$T
$T$T
$T$T
$T
$T$T$T$T$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T$T
$T$T
$T$T
$T$T
$T$T
$T$T$T$T
$T$T
$T
$T
$T
$T$T
$T
$T$T
$T$T
$T
$T
$T
$T
$T $T$T$T
$T
$T
$T
$T
$T
$T
$T $T$T
$T$T
$T
$T
$T$T$T$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T$T
$T
$T
$T$T$T$T
$T
$T
$T$T
$T$T
$T$T
$T$T
$T
$T$T$T$T$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T
$T$T
$T$T
$T$T
$T$T
$T$T
$T$T$T$T
Geog
raph
ic s
pace
Occurrence dataof species
Modeling algorithm(GLM, GAM, MaxEnt, GARP, Bioclim, etc)
Temperature
Moi
stur
e
Niche model
Predicted potential distribution
Output
Eco
logi
cal sp
ace
Environmentalinformation
Projecting backto geography
Input data
SDM / ENM is a correlative analytical approach meant to infer the ecological conditions under which a species live and to represent them geographically
What is Species Distribution Modeling (or Ecological Niche Modeling)?
![Page 4: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/4.jpg)
SDM has become very popular in the last two decades
Rodríguez-Castañeda et al. 2012. PlosOne 7(9): e44402
![Page 5: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/5.jpg)
In part because of its proven usefulness for addressing questions in a variety of fields, in both basic and applied science, such as biogeography, ecology, evolution, conservation biology, and public health, among others
Rodríguez-Castañeda et al. 2012. Plos One 7(9): e44402
![Page 6: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/6.jpg)
Williams et al. 2009. Div & Dist. 15: 565-576Raxworthy et al. 2003. Nature. 426: 837-841
SDM has been successfully used:
In biological survey and exploration
![Page 7: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/7.jpg)
In Taxonomy
Rissler & Apodaca. 2007. Syst Biol. 56: 924-942
Combining genetic and ecological information to strengthen the criteria to delimit cryptic species
![Page 8: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/8.jpg)
In Conservation Biology
Araiza et al. 2012. Cons Biol 26: 630-637
For reintroducing extinct species in the wild
333.6
km2
283.7
km2
385.8
km2
270
km2
381.6
km2
![Page 9: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/9.jpg)
Waltari et al. 2007. Plos One e563
In Biogeography
To test biogeographycal hypotheses
![Page 10: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/10.jpg)
Martínez-Meyer et al. 2004. GEB 13: 305-314
To investigate species’ responses to climate change in the past
Vole (Phenacomys intermedius)
In Climate Change Biology
![Page 11: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/11.jpg)
Identificación de corredores
Williams et al. 2005. Cons. Biol. 19: 1063-1074
Áreas nuevas
Áreas nuevas irremplazables
Áreas flexibles
Identificación de nuevas áreas importantes para la conservación
And proactive actions for the future
In Climate Change Biology
![Page 12: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/12.jpg)
Also because data to produce models are widely available thorough the internet for thousands of species all over the world, and software for modeling and analyses are also freely available, and more and more user-friendly
![Page 13: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/13.jpg)
Lozier et al. 2009. J.Biogeog. 36: 1623-1627
This is generally a good
news, nonetheless the
handiness of data and ease
of use of modeling programs
have caused a euphoria for
the use of SDM, in many
cases ignoring its
assumptions and reaches,
thus abusing the approach,
overselling and degrading the
scientific rigor of the field
![Page 14: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/14.jpg)
Data needs for building SDM
--
-
-
-
-
Presence & absenceIdeally, to characterize the
ecological niche of the
species, we would need data
of conditions that species
like (occurrence data) and
conditions that species
DOES NOT like (absence
data). However, there are
few or no databases
accounting for the absence
of species
![Page 15: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/15.jpg)
?
?
??
?
?
??
?
?
?
??
?
?
?
?
?
??
?
? ?
??
?
??
?
?
?
?
?
?
?
Data needs for building SDM
--
-
-
-
-
Presence/absence
(e.g. ANN)
Presence-only
(e.g. BIOCLIM)
Presence/pseudo-
absence
(e.g. GARP)
?
?
??
?
?
??
?
?
?
??
?
?
?
?
?
??
?
? ?
??
?
??
?
??
?
??
?
?
?
Presence/background
(e.g. Maxent)
![Page 16: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/16.jpg)
There are several sources of PBD
GBIF is the largest platform of data and with the widest taxonomic and geographic coverage with ≈650 million records of +1.6 million species
Monitoring
Specialized Literature Scientific collections &
HerbariaDistributed databases
![Page 17: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/17.jpg)
However, occurrence primary biodiversity data (PBD) are subject to errors and biases from different sources
![Page 18: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/18.jpg)
The two main errors in PBD for SDM are:
Taxonomic
When the identity of the species in the record is wrong
Brown Howler Monkey(Alouatta palliata)
Black Howler Monkey(Alouatta pigra)
![Page 19: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/19.jpg)
Geographic
When the position of the record is wrong
Myocastor coypus
8 mi. S of San Marcial, Tucumán, Argentina
The two main errors in PBD for SDM are:
![Page 20: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/20.jpg)
There are other important biases of biodiversity data for SDM
Geographic/ecologic bias
Geographic/taxonomic/temporal completeness
![Page 21: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/21.jpg)
Errors and biases have an important impact in SD models: Effect of data uncertainty
Thomomys bottae
By John Wieczorek
![Page 22: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/22.jpg)
Uncertaintyradius:3.3 + 6.3 kmn = 2624
T. bottae
Occurrences and uncertainty areas
![Page 23: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/23.jpg)
Effect of uncertaintyModel not accounting for uncertainty
100 kmT. bottae
Uncertainty does have an effect in SDM and has the potential to mislead interpretation and conclusions
![Page 24: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/24.jpg)
As a consequence, an important amount of data downloaded from GBIF is currently being discarded
for analyses
From a survey by the DM-GBIF Task Group to 136 experts
![Page 25: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/25.jpg)
Data quality makes it all in SDMModeling algorithms are quite powerful and they can model anything. But they are dumb: Algorithms do not make decisions, only follow orders
GLM Model
Maxent Model
We make decisions, so we are responsible for model outputs and interpretation
![Page 26: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/26.jpg)
The Future of SDM and data requirements
Kearney & Porter. 2009. Ecol. Letters 12: 334-350
Coupling with other approaches that demand an integration of different sorts if data: physiological data
![Page 27: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/27.jpg)
y = 49.107x-1.9078
R2 = 0.5745
P < 0.001
n = 48
0
10
20
30
40
50
60
0 2 4 6 8 10
Distance to centroid
Ab
un
dan
ce (
ind
/ro
ute
)
© Dick Cannings
The Future of SDM and data requirements
Coupling with other approaches that demand an integration of different data realms: population data
Martínez-Meyer et al. 2013. Biol Lett 9:20120637
![Page 28: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/28.jpg)
Anderson et al. 2009. Proc.Roy.Soc.B 276:1415-1420
The Future of SDM and data requirements
Coupling with other approaches that demand an integration of different data realms: dispersal data
![Page 29: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/29.jpg)
Araújo & Loto. 2007. GEB 16: 743-753
The Future of SDM and data requirements
Coupling with other approaches that demand an integration of different data realms: interaction networks
![Page 30: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/30.jpg)
Challenges
Occurrence data is the cornerstone of SDM. Quantity –but mainly quality– of biodiversity data is key for producing reliable models. In their current form at least three issues limit a faster development of SDM:
1.Data issues: Inaccuracies (Geographic, Taxonomic and Temporal errors and biases)
2.Accessibility issues: Full information of records is often missing or hard to obtain due to inefficient functionality of data processing
3.Use issues: Careless or inappropriate use of data, underproficient knowledge of the taxonomic group, data and/or modeling methods
Stakeholders of PBD (i.e., data providers, data aggregators and data users) need to have clear their role and duties in PDB circulation and use to address and overcome these challenges, as well as a more fluent communication among them
![Page 31: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/31.jpg)
ChallengesSDM progress is seriously hindered by:
a. The biases and gaps in the available biological information
b. The lack of reliable absence data and the use of background or pseudo-absences as an alternative
c. The use of sophisticated and complex modelling methods able to model the own errors and biases in the response variables
d. The absence of an agreed “gold-standard” procedure to validate model outputs.
e. The lack of information about the survey effort carried in each locality
f. The lack of discrimination between poorly and relatively well-surveyed localities
g. Our incapacity to know the location of the minimum number of localities required to generate reliable interpolations to recover the full spectrum of environmental conditions in a territory
![Page 32: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/32.jpg)
the GBIF strategy the SDM strategy
GBIFdata
SDMsModel
validationGaps identification
• To provide a “seal of quality” for the data
• An agreement on the criteria determining data quality.
A Proposal
![Page 33: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/33.jpg)
Conclusions
SDM / ENM is a popular research field still in expansion and with fertile grounds to generate basic and applied knowledge
The present and future of SDM is quite exciting but demands a major effort from all stakeholders to make PBD more abundant, more accessible and more reliable
Integration of PBD with other data realms opens new veins of opportunities to take biodiversity analyses to a next level, thus development of standards and protocols of interconnection with other sources of data are needed, as well as new and creative methodologies for their analysis
![Page 34: Data Needs and Challenges for the Present and Future of ... · Data Needs and Challenges for the Present and Future of Species Distribution Modelling Robert P. Anderson, City University](https://reader033.vdocuments.us/reader033/viewer/2022050223/5f68d0bbd57a7c48542aa4dc/html5/thumbnails/34.jpg)
Many thanks and sorry for not being able to attend this meeting