geographic potential of disease caused by ebola and...

11
Acta Tropica 162 (2016) 114–124 Contents lists available at ScienceDirect Acta Tropica journal homepage: www.elsevier.com/locate/actatropica Geographic potential of disease caused by Ebola and Marburg viruses in Africa A. Townsend Peterson a,, Abdallah M. Samy a,b,a Biodiversity Institute, The University of Kansas, Lawrence, KS, 66045, USA b Faculty of Science, Ain Shams University, Abbassia, Cairo, 11566, Egypt article info Article history: Received 12 February 2016 Received in revised form 4 June 2016 Accepted 10 June 2016 Available online 14 June 2016 Keywords: Zaire ebolavirus Taï Forest ebolavirus Sudan ebolavirus Marburg Africa Transmission Ecological niche abstract Filoviruses represent a significant public health threat worldwide. West Africa recently experienced the largest-scale and most complex filovirus outbreak yet known, which underlines the need for a predictive understanding of the geographic distribution and potential for transmission to humans of these viruses. Here, we used ecological niche modeling techniques to understand the relationship between known filovirus occurrences and environmental characteristics. Our study derived a picture of the potential transmission geography of Ebola virus species and Marburg, paired with views of the spatial uncer- tainty associated with model-to-model variation in our predictions. We found that filovirus species have diverged ecologically, but only three species are sufficiently well known that models could be developed with significant predictive power. We quantified uncertainty in predictions, assessed potential for out- breaks outside of known transmission areas, and highlighted the Ethiopian Highlands and scattered areas across East Africa as additional potentially unrecognized transmission areas. © 2016 Elsevier B.V. All rights reserved. 1. Introduction Since 1976, scattered human cases and outbreaks have been documented of hemorrhagic fever caused by viruses of the family Filoviridae, caused by the five known Ebola species and the related Marburg viruses (Groseth et al., 2007), which are non-segmented, negative-stranded RNA viruses. These viruses are believed to be hosted in the long term by fruit bats (family Pteropodidae), although full clarity in this issue is largely lacking: that is, solid causal evidence is accumulating regarding the bat Rousettus aegyp- tiacus as a reservoir for Marburg virus (Amman et al., 2012; Amman et al., 2015; Towner et al., 2009), and a tie of infections in humans to exposure to mines and caves is clear (Peterson et al., 2006). How- ever, evidence regarding the reservoir of Ebola virus is less clear (Pigott et al., 2015). Although compelling temporal and anecdotal links have been pointed out (Leroy et al., 2009), serological evidence has painted a more complex picture, with detections of viruses in multiple bat species, and in regions and under ecological conditions where particular viruses have never been documented (Hayman et al., 2012; Pourrut et al., 2009). Most recently, Ebola surprised the Corresponding author. E-mail addresses: [email protected] (A.T. Peterson), [email protected], [email protected] (A.M. Samy). world community with an emergence in Guinea—quite apart from the magnitude of the outbreak, Zaire ebolavirus was unknown in West Africa, as the virus that would have been expected in Guinea was Taï Forest ebolavirus (Bausch and Schwarz, 2014). The West African Zaire ebolavirus outbreak underlines the need for a predictive understanding of the geographic distribution of these viruses and their potential for transmission to humans across Africa. Although studies on filoviruses almost invariably include a table (e.g., Chippaux, 2014) and/or map (e.g., Polonsky et al., 2014) of known outbreaks, only four studies have gone beyond the occur- rences to interpolate or estimate a full potential distribution: early analyses that explored the basic idea (Peterson et al., 2004, 2006) and a recent pair of assessments that took advantage of an addi- tional decade of accumulation of occurrence data (Pigott et al., 2014, 2015). The older analyses are rather dated, in terms of both the occurrence information and the quality of the environmental data and tools for analysis; the new studies, on the other hand, have a number of shortcomings in methodology, which are treated in detail in the Discussion. This contribution aims to present a more comprehensive view of the geographic potential of Ebola and Marburg viruses known to infect humans, to offer an up-to-date and rigorous view of where these viruses may be found. Specifically, we (1) tested for niche divergence between Ebola species, and consequently treated Ebola species separately in modeling efforts; (2) we developed analyses http://dx.doi.org/10.1016/j.actatropica.2016.06.012 0001-706X/© 2016 Elsevier B.V. All rights reserved.

Upload: others

Post on 10-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

Acta Tropica 162 (2016) 114ndash124

Contents lists available at ScienceDirect

Acta Tropica

journa l homepage wwwe lsev ier com locate ac ta t ropica

Geographic potential of disease caused by Ebola and Marburg virusesin Africa

A Townsend Peterson alowast Abdallah M Samy ablowast

a Biodiversity Institute The University of Kansas Lawrence KS 66045 USAb Faculty of Science Ain Shams University Abbassia Cairo 11566 Egypt

a r t i c l e i n f o

Article historyReceived 12 February 2016Received in revised form 4 June 2016Accepted 10 June 2016Available online 14 June 2016

KeywordsZaire ebolavirusTaiuml Forest ebolavirusSudan ebolavirusMarburgAfricaTransmissionEcological niche

a b s t r a c t

Filoviruses represent a significant public health threat worldwide West Africa recently experienced thelargest-scale and most complex filovirus outbreak yet known which underlines the need for a predictiveunderstanding of the geographic distribution and potential for transmission to humans of these virusesHere we used ecological niche modeling techniques to understand the relationship between knownfilovirus occurrences and environmental characteristics Our study derived a picture of the potentialtransmission geography of Ebola virus species and Marburg paired with views of the spatial uncer-tainty associated with model-to-model variation in our predictions We found that filovirus species havediverged ecologically but only three species are sufficiently well known that models could be developedwith significant predictive power We quantified uncertainty in predictions assessed potential for out-breaks outside of known transmission areas and highlighted the Ethiopian Highlands and scattered areasacross East Africa as additional potentially unrecognized transmission areas

copy 2016 Elsevier BV All rights reserved

1 Introduction

Since 1976 scattered human cases and outbreaks have beendocumented of hemorrhagic fever caused by viruses of the familyFiloviridae caused by the five known Ebola species and the relatedMarburg viruses (Groseth et al 2007) which are non-segmentednegative-stranded RNA viruses These viruses are believed tobe hosted in the long term by fruit bats (family Pteropodidae)although full clarity in this issue is largely lacking that is solidcausal evidence is accumulating regarding the bat Rousettus aegyp-tiacus as a reservoir for Marburg virus (Amman et al 2012 Ammanet al 2015 Towner et al 2009) and a tie of infections in humans toexposure to mines and caves is clear (Peterson et al 2006) How-ever evidence regarding the reservoir of Ebola virus is less clear(Pigott et al 2015) Although compelling temporal and anecdotallinks have been pointed out (Leroy et al 2009) serological evidencehas painted a more complex picture with detections of viruses inmultiple bat species and in regions and under ecological conditionswhere particular viruses have never been documented (Haymanet al 2012 Pourrut et al 2009) Most recently Ebola surprised the

lowast Corresponding authorE-mail addresses townkuedu (AT Peterson) samykuedu

samysciasuedueg (AM Samy)

world community with an emergence in Guineamdashquite apart fromthe magnitude of the outbreak Zaire ebolavirus was unknown inWest Africa as the virus that would have been expected in Guineawas Taiuml Forest ebolavirus (Bausch and Schwarz 2014)

The West African Zaire ebolavirus outbreak underlines the needfor a predictive understanding of the geographic distribution ofthese viruses and their potential for transmission to humans acrossAfrica Although studies on filoviruses almost invariably include atable (eg Chippaux 2014) andor map (eg Polonsky et al 2014)of known outbreaks only four studies have gone beyond the occur-rences to interpolate or estimate a full potential distribution earlyanalyses that explored the basic idea (Peterson et al 2004 2006)and a recent pair of assessments that took advantage of an addi-tional decade of accumulation of occurrence data (Pigott et al 20142015) The older analyses are rather dated in terms of both theoccurrence information and the quality of the environmental dataand tools for analysis the new studies on the other hand havea number of shortcomings in methodology which are treated indetail in the Discussion

This contribution aims to present a more comprehensive viewof the geographic potential of Ebola and Marburg viruses known toinfect humans to offer an up-to-date and rigorous view of wherethese viruses may be found Specifically we (1) tested for nichedivergence between Ebola species and consequently treated Ebolaspecies separately in modeling efforts (2) we developed analyses

httpdxdoiorg101016jactatropica2016060120001-706Xcopy 2016 Elsevier BV All rights reserved

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 115

for Marburg virus as well as Ebola (3) we assessed and addressedpseudoreplication and its effects on model predictions and (4) weaddressed uncertainty in our model outputs In our model calibra-tion efforts we took care to control for accessibility of areas butwe also assessed the possibility of long-distance jumpsmdasheffectivelylsquosurprisesrsquo akin to Zaire ebolavirus appearing in West Africa in 2014(Bausch and Schwarz 2014)

2 Materials and methods

This paper presents a variety of niche model-based analysesof filovirus (ie Ebola and Marburg) potential distributions acrossAfrica We excluded two filovirus taxa from consideration for lackof sufficient occurrence information (Cuevavirus is known fromone site in Spain only) or any occurrence information whatsoever(transmission of Reston ebolavirus from its natural reservoir is notknown from any definable location such that no geographic infor-mation is available to us) We are fully cognizant that occurrenceinformation is likely not sufficient to characterize potential distri-butions of two additional species (Bundibugyo ebolavirus and TaiumlForest ebolavirus) but develop preliminary models and associateduncertainty estimates specifically to quantify and demonstrate thisinsufficiency

We focused on developing predictions of potential filovirusdistributions but we also carefully avoided overinterpreting ourresults That is we tested niche differentiation before we aggre-gated multiple species (only Zaire ebolavirus and Sudan ebolaviruswere amenable to testing) Analyzing species separately we wereable to develop analyses for three species (Zaire ebolavirus Sudanebolavirus Marburg) but not for the less-well-known Taiuml Forestebolavirus and Bundibugyo ebolavirus We have incorporated severalmethodological innovations and precautions that are recognized inthe biodiversity realm (eg model calibration within an accessiblearea Peterson et al 2011) but that have not been adopted widelyenough in work to date in the public health field (Peterson 2014b)

21 Input data

We accumulated occurrence data for this study from diversesources including our earlier compilation (Peterson et al 2004Peterson et al 2006) and numerous subsequent compilations(Bausch and Schwarz 2014 Changula et al 2014 Chippaux 2014Leroy et al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy2014) we were unable to use the data set published in associa-tion with the recent mapping exercise (Mylne et al 2014 Pigottet al 2015) because only the human occurrence data were pub-lished for Ebola animal detection data used in analyses have notto our knowledge been made available to the broader community(we did not request them of the authors of those studies however)

As an independent source however we reviewed all posts in theProMed archives (httpwwwpromedmailorg queries executed1 November 2014) that included reference to ldquoEbolardquo ldquoMarburgrdquo orldquofilovirusrdquo we included only occurrences cited as ldquoconfirmedrdquo andalways verified records against independent information sourcesWe omitted occurrences detected serologically in bats in view ofthe rather odd patterns that such detections have shownmdashwitnesseg the detection of Marburg in the Congo Basin (Towner et al2007) an entire biome where it has never been found to occur oth-erwise evaluation of effects of including bat detection data in nichemodels was complicated by the fact that the bat detection data havenot been made available to the broader community in anythingother than summary form (Mylne et al 2014 Pigott et al 2014)but we would not incorporate them into analyses until considerablymore clarity was available regarding their meaning

An ideal spatialenvironmental analysis of this sort would eitherincorporate or eliminate effects of spatial autocorrelation of sam-pling in occurrence data for such models (de Oliveira et al 2014Veloz 2009) For two reasons however we did not take such stepsFirst the usual concern is that the spatial autocorrelation is insampling rather than in the occurrence of the phenomenon itselfif independent occurrences of the phenomenon in question (egfilovirus transmission from reservoir to humans) are spatially auto-correlated by their nature (as we suspect is the case with filovirusoccurrences wherein all occurrences of transmission at least tohumans should be detected and reported) the effects are per-haps of less concern Second and more practically sample sizeseven for the three relatively well-characterized filovirus species arenonetheless still small such that any reduction to remove autocor-relation would be crippling

For each occurrence in the final list we used Internet-based elec-tronic gazetteers to add geographic coordinates to the data record(httpwwwfallingraincom) We assigned a rough estimate ofthe uncertainty associated with each data record (entering a cavewas assigned 200 m uncertainty a person in a village was assigned5ndash10 km a general description of a region over which a humanwas infected was assigned 50ndash170 km depending on the density ofother major landmarks or political regions Fig 1) This data set isdeposited and fully available in an institutional online repository(httphdlhandlenet180821024)

A further key element in developing ecological niche models arehypotheses of areas (M) that have been accessible to the speciesover relevant time periods (Barve et al 2011) these areas are ineffect hypotheses of relevant areas that exclude areas never sam-pled by the species in question In light of the minimal informationavailable about the natural history and biogeography of filoviruseswe created rather general M hypotheses as 7 (sim770 km) buffersaround known occurrences of each species but then modifiedthose areas in light of known biogeographic barriers (eg lowlandsaround and northwest of Lake Turkana in northern Kenya separat-ing humid areas of the Ethiopian Highlands from areas to the southand west see Fig 1)

We were concerned about the broad spatial autocorrelation pat-terns in climate data which lead to rather general models that donot hold much spatial detail as was the case in our earlier filovirusmodels (Peterson et al 2004) At the same time we resisted thetemptation to use the more-detailed data now available from sen-sors aboard the MODIS satellite because known filovirus outbreaksdate back as far as 1976 using newer imagery would likely lead toinappropriate environmental signals in view of the large-scale landuse changes that have affected Africa

As a consequence we used the older Normalized Difference Veg-etation Index (NDVI) data from the AVHRR satellite (James andKalluri 1994) as an environmental data set in this study (1 kmspatial resolution) We used 12 monthly composite NDVI data lay-ers (downloaded with atmospheric corrections already completedfrom UMD 2001) corresponding to February 1995ndashJanuary 1996to capture aspects of land cover and seasonality These data lay-ers correspond approximately to the midpoint of the time span offilovirus occurrence data used on model developmentmdashalthoughthis lsquotime-averagedrsquo approach to temporal consistency betweenoccurrence data and environmental data is not satisfactory (egland-use change before or after the date of the environmentaldata is not taken into account in analyses) a robust lsquotime-specificrsquomethodology is not as-yet available in ecological niche modeling(Peterson et al 2005) We inspected each layer for artifacts of pro-cessing and concatenated them into lsquoraster stacksrsquo for analysis inQGIS (version 24) we then used principal components analysis(PCA) to (1) reduce numbers of dimensions and (2) create orthogo-nal (ie uncorrelated) highly informative dimensions for analysisPCA has the added advantage of creating individual components

116 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 1 Summary of known occurrences of five filovirus species across Africa with associated uncertainty values These data correspond to the filovirus occurrence data setdescribed in the Methods and made openly available to the scientific community

that respond (at least roughly) to major axes of interest such aslatitude seasonality etc we believe that the high informationcontent of principal component axes combined with their orthogo-nality more than offsets the loss of interpretability associated withcomposite axes (Peterson 2014b) For the initial suite of ecologicalniche models we fit PCAs within M hypotheses for each speciesto be analyzed as M is the only region of importance to the actualdistribution of a species however for the final analysis assessingcontinentwide patterns of suitability and the potential for unex-pected emergence events of filovirus species (see below) we useda single PCA over all of Africa and used the M areas to clip out datasets for analysis so that the environmental data would be on thesame axes between the restricted and continentwide datasets

22 Model calibration

We initially evaluated a series of algorithms for use in this studyparticularly those facilitated by the openModeller platform (deSouza Munoz et al 2011) but we settled on Maxent version 333k(Phillips et al 2006) in light of the convenient bootstrapping fea-tures which allowed us to build in dimensions of uncertainty inour models We used the jackknife option to identify variables notcontributing importantly to model robustness (ie extrinsic gain)

Once a reduced set of variables was identified we conducted 10replicate analyses for each species based on a 50 bootstrap ofavailable occurrence data we used the median logistic output ofreplicate analyses as an appropriate summary although this met-ric has been interpreted in the form of a probability of occurrence(Phillips et al 2006) a more conservative interpretation is as ametric of suitability only (Peterson 2014b) which we follow hereAlthough we much appreciate the importance of thresholding eco-logical niche models to avoid overfitting and overinterpretation ofmodel outputs (Peterson et al 2007) in light of the small samplesizes available to us for all of the species of interest we generallyretained continuous model outputs for interpretation and visual-ization of suitability patterns

23 Assess niche similarity between species

We tested for departure from niche similarity using the back-ground similarity test developed by Warren et al (2008) whichassesses the similarity or difference of two niche estimates in rela-tion to similarity among large numbers of null replicate modelsIn effect the test assesses whether two niche estimates are moredifferent than one would expect given the lsquobackground similar-ity of the two accessible regions For this test we focused on the

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 117

Fig 2 Effects of pseudoreplication on model outputs for Zaire ebolavirus The middle map shows the difference between the logistic modeled suitability scores from modelsbased on pseudoreplicated data mimicking recent modeling efforts (Pigott et al 2014) the upper and lower panels show pixels (in black) in the western and eastern portionsof this speciesrsquo known distribution in which the pseudoreplicated model presented values out of the range observed in 25 replicate analyses of non-pseudoreplicated databased on random points cast across the uncertainty area for each point

118 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

two Ebola species for which reasonable numbers of occurrenceswere available (Zaire ebolavirus with N = 14 and Sudan ebolaviruswith N = 6) We did not develop such tests for Taiuml Forest ebolavirusand Bundibugyo ebolavirus or for the two likely-distinct species ofMarburg (Peterson and Holder 2012) which would have much-reduced statistical power owing to low sample sizes

Random background points were generated from the accessiblearea for each species in numbers equal to numbers of real occur-rence data available for each species with 100 replicate samplesand with the least training presence thresholding option specifiedThese points were used to generate niche models in Maxent andbackground models were compared with the real-species modelin each replicate two such comparisons were developed testingeach of the species against the background of the other We calcu-lated Hellingerrsquos I indices in R (R Development Core Team 2013)for all replicates We used the 5th percentile of each distribution ofbackground similarity values as a critical value in rejecting the nullhypothesis of similarity between the two species

24 Pseudoreplication

In the recently published analyses of Ebola potential distribu-tions (Pigott et al 2014) the authors chose to interpret the manycases of Ebola detected in great apes in Gabon and Republic ofthe Congo (Sleeman 2004) as independent transmission events Toassess the effects of this assumption we compared Table 2 of Pigottet al (2014) to the occurrence data matrix cited above we identi-fied single occurrences in the latter that corresponded to multipleoccurrences in the formermdashindeed among other instances of pseu-doreplication in one case 21 occurrences in Table 2 of Pigott et al(2014) related to a single occurrence in our dataset

Hence we developed models for Zaire ebolavirus with the singlerepresentative (no pseudoreplication) and with multiple represen-tatives (pseudoreplication) To mimic the multiple sites cited forthese pseudoreplicates in Table 2 of Pigott et al (2014) instead ofusing a single coordinate pair we used random points cast withinthe uncertainty radius which covered a particularly broad areaWe then compared the resulting two maps detecting areas over-and under-emphasized by models including pseudoreplication Weidentified areas for which the pseudoreplicated model presentedpixel values outside the range of values among 25 replicate modelsdeveloped without pseudoreplication (see next paragraph) thesesites had pseudoreplication effects on model outputs that wereparticularly pronounced

25 Assess uncertainty

In the case of viral diseases as rare as Ebola and Marburg it is notsufficient simply to estimate suitability rather a crucial elementis assessment of uncertainty in model predictions to the extentpossible Hence to incorporate uncertainty deriving from posi-tional considerations into our models we created 25 sets of randompoints cast one per outbreak within the uncertainty radius of eachoutbreak (see Fig 1) following our previous examples (Nakazawaet al 2010 Peterson et al 2006) We then developed 25 repli-cate analyses following protocols listed above each consisting of 10bootstrap subsampling analyses From the 25 medians of the boot-strap replicates we obtained the maximum and minimum valuesand calculated range = maximum minus minimum as an index by whichto characterize uncertainty Because the range of values generatedin this index will be highly dependent on sample sizes we use thisindex principally for comparisons among areas within a particularspecies and not for comparisons among species

26 Assess possibility of ldquosurprisesrdquo

The known history of Ebola virus transmission events hasalready seen three events in which the viruses appearedunexpectedly Zaire ebolavirus and Sudan ebolavirus appearingsimultaneously in 1976 Reston ebolavirus apparently coming fromthe Philippines and Zaire ebolavirus appearing in Guinea in 2014In each case a virus appeared at a place where it was not expectedAs a consequence an important step is to assess the possibility offurther such distributional novelties

We maintained our focus on model calibration within the acces-sible area M but then transferred model predictions across all ofAfrica which is the appropriate means by which to extend nichemodels across areas of potential (but not realized) distribution out-side of our current concept of the accessible area of each species(Owens et al 2013) That is for this analysis we conducted a princi-pal components analysis across the entire continent but extractedthe portion of Africa within each speciesrsquo individual M area formodel calibration Although these principal components were notas specific the procedure has the advantage of putting the envi-ronmental data for within M and across Africa on the same scaleModels were calibrated within M and then transferred to all ofAfrica We assessed the degree to which model transfers would beextrapolative using the MOP metric (Owens et al 2013)

3 Results

We first assessed ecological niche similarity between Zaireebolavirus and Sudan ebolavirus The two null (background simi-larity) distributions (ie Zaire ebolavirus compared to backgroundmodels for Sudan ebolavirus and vice versa) had ranges of similar-ity values of 076ndash092 and 084ndash098 whereas the observed valuewas 048 As a consequence we rejected the null hypothesis of nichesimilarity in favor of an alternative hypothesis of niche differenceAn important implication of this result is that combining two Ebolaspecies with differentiated niches in analyses is not justified ana-lyzing them together would result in overestimating the breadth ofecological niches

Assessing effects of pseudoreplication of occurrence data onmodel outputs multiple ldquocopiesrdquo of occurrence data influencedmodeled distributional areas rather dramatically Specificallypseudoreplication caused underestimation of suitability of areasin southern Gabon southern Republic of the Congo and south-ern and southeastern Democratic Republic of the Congo (Fig 2)and overestimation of suitability in parts of the northern CongoBasin and southern Cameroon These differences caused by pseu-doreplicating occurrence data were lsquoout of rangersquo with regard to ourmodels incorporating positional uncertainty and thereby indicatesubstantial effects of pseudoreplication in terms of underestimat-ing suitability particularly in the southern and southeastern sectorsof the Congo Basin

Modeled suitability and associated uncertainty for each of the 5African filovirus species are summarized in Figs 3 and 4 Specif-ically Sudan ebolavirus was reconstructed as having a potentialdistribution focused in northwestern Uganda northeastern Demo-cratic Republic of the Congo and southern South Sudan Thesepredictions were particularly unstable however in the DemocraticRepublic of the Congo in contrast areas in Uganda were stablyidentified as presenting high suitability Zaire ebolavirus was mod-eled as having potential distributional areas across the known rangeof the species albeit extending more broadly in Cameroon IvoryCoast and other countries predictions in the lower (western) por-tions of the Congo Basin such as in Gabon Equatorial Guinea andsouthern Cameroon showed relatively high uncertainty

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 119

Fig 3 Modeled suitability for three relatively well-known species of filoviruses Sudan ebolavirus Zaire ebolavirus and Marburg Left-hand column maps show suitabilityvalues with darker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based ondifferent random representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a bluerectangle in an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

120 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 4 Modeled suitability for two poorly-known species of filoviruses Taiuml Forest ebolavirus and Bundibugyo ebolavirus Left-hand column maps show suitability values withdarker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based on differentrandom representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a blue rectanglein an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

Modeled potential distributional areas for Marburg were rathermore scattered and diffuse across the accessible area for thisspecies Our models had some level of difficulty in distinguishingbetween the more seasonal and scrubby habitats of East Africa andthe less seasonal habitats of the Congo Basin however uncertaintywas focused in those latter regions of the Congo Basin such that thestrength of the prediction of suitability in those regions was unclearModeled potential distributional areas for Taiuml Forest ebolavirus andBundibugyo ebolavirus were both diffuse across their respectiveaccessible areas and all modeled suitable areas were also highlyuncertain such that the distributions of these two species remainpoorly characterized Relationships between uncertainty and suit-ability were curvilinear and diverse in form in different species(Fig 5) but clearly reflected effects of low sample sizes of knownoccurrences of some species (see in particular Taiuml Forest ebolavirusin Fig 5)

Assessing the possibility of unexpected distributional poten-tial as regards Ebola and Marburg virus distributions some areasemerge as potential distributional areas not within the knowndistributions of the three species for which sample sizes were suf-ficient to permit model development (Fig 6) Specifically for Zaire

ebolavirus two areas could be discerned southern Ethiopia and theGulf of Guinea rim of West Africa For Sudan ebolavirus and Marburgareas in southwestern Ethiopia and East Africa more generally werenotable in terms of distributional potential outside of known areas

4 Discussion

This study updates the current understanding of filovirus geog-raphy in Africa In the 10 years since our original analysis thenumber of known outbreaks has approximately doubled butmost have occurred under quite predictable circumstances how-ever with augmented input data as well as improved analyticalframeworks new maps have been developed that should bothanticipate future outbreak events and highlight areas of uncer-tain predictions Our careful consideration of uncertainty and howits effects percolate through the analysis process however indi-cated that reasonably robust models could be developed onlyfor Zaire ebolavirus Sudan ebolavirus and Marburg for Taiuml Forestebolavirus and Bundibugyo ebolavirus occurrence data are simplyinsufficient to permit rigorous model development at this point

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 2: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 115

for Marburg virus as well as Ebola (3) we assessed and addressedpseudoreplication and its effects on model predictions and (4) weaddressed uncertainty in our model outputs In our model calibra-tion efforts we took care to control for accessibility of areas butwe also assessed the possibility of long-distance jumpsmdasheffectivelylsquosurprisesrsquo akin to Zaire ebolavirus appearing in West Africa in 2014(Bausch and Schwarz 2014)

2 Materials and methods

This paper presents a variety of niche model-based analysesof filovirus (ie Ebola and Marburg) potential distributions acrossAfrica We excluded two filovirus taxa from consideration for lackof sufficient occurrence information (Cuevavirus is known fromone site in Spain only) or any occurrence information whatsoever(transmission of Reston ebolavirus from its natural reservoir is notknown from any definable location such that no geographic infor-mation is available to us) We are fully cognizant that occurrenceinformation is likely not sufficient to characterize potential distri-butions of two additional species (Bundibugyo ebolavirus and TaiumlForest ebolavirus) but develop preliminary models and associateduncertainty estimates specifically to quantify and demonstrate thisinsufficiency

We focused on developing predictions of potential filovirusdistributions but we also carefully avoided overinterpreting ourresults That is we tested niche differentiation before we aggre-gated multiple species (only Zaire ebolavirus and Sudan ebolaviruswere amenable to testing) Analyzing species separately we wereable to develop analyses for three species (Zaire ebolavirus Sudanebolavirus Marburg) but not for the less-well-known Taiuml Forestebolavirus and Bundibugyo ebolavirus We have incorporated severalmethodological innovations and precautions that are recognized inthe biodiversity realm (eg model calibration within an accessiblearea Peterson et al 2011) but that have not been adopted widelyenough in work to date in the public health field (Peterson 2014b)

21 Input data

We accumulated occurrence data for this study from diversesources including our earlier compilation (Peterson et al 2004Peterson et al 2006) and numerous subsequent compilations(Bausch and Schwarz 2014 Changula et al 2014 Chippaux 2014Leroy et al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy2014) we were unable to use the data set published in associa-tion with the recent mapping exercise (Mylne et al 2014 Pigottet al 2015) because only the human occurrence data were pub-lished for Ebola animal detection data used in analyses have notto our knowledge been made available to the broader community(we did not request them of the authors of those studies however)

As an independent source however we reviewed all posts in theProMed archives (httpwwwpromedmailorg queries executed1 November 2014) that included reference to ldquoEbolardquo ldquoMarburgrdquo orldquofilovirusrdquo we included only occurrences cited as ldquoconfirmedrdquo andalways verified records against independent information sourcesWe omitted occurrences detected serologically in bats in view ofthe rather odd patterns that such detections have shownmdashwitnesseg the detection of Marburg in the Congo Basin (Towner et al2007) an entire biome where it has never been found to occur oth-erwise evaluation of effects of including bat detection data in nichemodels was complicated by the fact that the bat detection data havenot been made available to the broader community in anythingother than summary form (Mylne et al 2014 Pigott et al 2014)but we would not incorporate them into analyses until considerablymore clarity was available regarding their meaning

An ideal spatialenvironmental analysis of this sort would eitherincorporate or eliminate effects of spatial autocorrelation of sam-pling in occurrence data for such models (de Oliveira et al 2014Veloz 2009) For two reasons however we did not take such stepsFirst the usual concern is that the spatial autocorrelation is insampling rather than in the occurrence of the phenomenon itselfif independent occurrences of the phenomenon in question (egfilovirus transmission from reservoir to humans) are spatially auto-correlated by their nature (as we suspect is the case with filovirusoccurrences wherein all occurrences of transmission at least tohumans should be detected and reported) the effects are per-haps of less concern Second and more practically sample sizeseven for the three relatively well-characterized filovirus species arenonetheless still small such that any reduction to remove autocor-relation would be crippling

For each occurrence in the final list we used Internet-based elec-tronic gazetteers to add geographic coordinates to the data record(httpwwwfallingraincom) We assigned a rough estimate ofthe uncertainty associated with each data record (entering a cavewas assigned 200 m uncertainty a person in a village was assigned5ndash10 km a general description of a region over which a humanwas infected was assigned 50ndash170 km depending on the density ofother major landmarks or political regions Fig 1) This data set isdeposited and fully available in an institutional online repository(httphdlhandlenet180821024)

A further key element in developing ecological niche models arehypotheses of areas (M) that have been accessible to the speciesover relevant time periods (Barve et al 2011) these areas are ineffect hypotheses of relevant areas that exclude areas never sam-pled by the species in question In light of the minimal informationavailable about the natural history and biogeography of filoviruseswe created rather general M hypotheses as 7 (sim770 km) buffersaround known occurrences of each species but then modifiedthose areas in light of known biogeographic barriers (eg lowlandsaround and northwest of Lake Turkana in northern Kenya separat-ing humid areas of the Ethiopian Highlands from areas to the southand west see Fig 1)

We were concerned about the broad spatial autocorrelation pat-terns in climate data which lead to rather general models that donot hold much spatial detail as was the case in our earlier filovirusmodels (Peterson et al 2004) At the same time we resisted thetemptation to use the more-detailed data now available from sen-sors aboard the MODIS satellite because known filovirus outbreaksdate back as far as 1976 using newer imagery would likely lead toinappropriate environmental signals in view of the large-scale landuse changes that have affected Africa

As a consequence we used the older Normalized Difference Veg-etation Index (NDVI) data from the AVHRR satellite (James andKalluri 1994) as an environmental data set in this study (1 kmspatial resolution) We used 12 monthly composite NDVI data lay-ers (downloaded with atmospheric corrections already completedfrom UMD 2001) corresponding to February 1995ndashJanuary 1996to capture aspects of land cover and seasonality These data lay-ers correspond approximately to the midpoint of the time span offilovirus occurrence data used on model developmentmdashalthoughthis lsquotime-averagedrsquo approach to temporal consistency betweenoccurrence data and environmental data is not satisfactory (egland-use change before or after the date of the environmentaldata is not taken into account in analyses) a robust lsquotime-specificrsquomethodology is not as-yet available in ecological niche modeling(Peterson et al 2005) We inspected each layer for artifacts of pro-cessing and concatenated them into lsquoraster stacksrsquo for analysis inQGIS (version 24) we then used principal components analysis(PCA) to (1) reduce numbers of dimensions and (2) create orthogo-nal (ie uncorrelated) highly informative dimensions for analysisPCA has the added advantage of creating individual components

116 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 1 Summary of known occurrences of five filovirus species across Africa with associated uncertainty values These data correspond to the filovirus occurrence data setdescribed in the Methods and made openly available to the scientific community

that respond (at least roughly) to major axes of interest such aslatitude seasonality etc we believe that the high informationcontent of principal component axes combined with their orthogo-nality more than offsets the loss of interpretability associated withcomposite axes (Peterson 2014b) For the initial suite of ecologicalniche models we fit PCAs within M hypotheses for each speciesto be analyzed as M is the only region of importance to the actualdistribution of a species however for the final analysis assessingcontinentwide patterns of suitability and the potential for unex-pected emergence events of filovirus species (see below) we useda single PCA over all of Africa and used the M areas to clip out datasets for analysis so that the environmental data would be on thesame axes between the restricted and continentwide datasets

22 Model calibration

We initially evaluated a series of algorithms for use in this studyparticularly those facilitated by the openModeller platform (deSouza Munoz et al 2011) but we settled on Maxent version 333k(Phillips et al 2006) in light of the convenient bootstrapping fea-tures which allowed us to build in dimensions of uncertainty inour models We used the jackknife option to identify variables notcontributing importantly to model robustness (ie extrinsic gain)

Once a reduced set of variables was identified we conducted 10replicate analyses for each species based on a 50 bootstrap ofavailable occurrence data we used the median logistic output ofreplicate analyses as an appropriate summary although this met-ric has been interpreted in the form of a probability of occurrence(Phillips et al 2006) a more conservative interpretation is as ametric of suitability only (Peterson 2014b) which we follow hereAlthough we much appreciate the importance of thresholding eco-logical niche models to avoid overfitting and overinterpretation ofmodel outputs (Peterson et al 2007) in light of the small samplesizes available to us for all of the species of interest we generallyretained continuous model outputs for interpretation and visual-ization of suitability patterns

23 Assess niche similarity between species

We tested for departure from niche similarity using the back-ground similarity test developed by Warren et al (2008) whichassesses the similarity or difference of two niche estimates in rela-tion to similarity among large numbers of null replicate modelsIn effect the test assesses whether two niche estimates are moredifferent than one would expect given the lsquobackground similar-ity of the two accessible regions For this test we focused on the

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 117

Fig 2 Effects of pseudoreplication on model outputs for Zaire ebolavirus The middle map shows the difference between the logistic modeled suitability scores from modelsbased on pseudoreplicated data mimicking recent modeling efforts (Pigott et al 2014) the upper and lower panels show pixels (in black) in the western and eastern portionsof this speciesrsquo known distribution in which the pseudoreplicated model presented values out of the range observed in 25 replicate analyses of non-pseudoreplicated databased on random points cast across the uncertainty area for each point

118 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

two Ebola species for which reasonable numbers of occurrenceswere available (Zaire ebolavirus with N = 14 and Sudan ebolaviruswith N = 6) We did not develop such tests for Taiuml Forest ebolavirusand Bundibugyo ebolavirus or for the two likely-distinct species ofMarburg (Peterson and Holder 2012) which would have much-reduced statistical power owing to low sample sizes

Random background points were generated from the accessiblearea for each species in numbers equal to numbers of real occur-rence data available for each species with 100 replicate samplesand with the least training presence thresholding option specifiedThese points were used to generate niche models in Maxent andbackground models were compared with the real-species modelin each replicate two such comparisons were developed testingeach of the species against the background of the other We calcu-lated Hellingerrsquos I indices in R (R Development Core Team 2013)for all replicates We used the 5th percentile of each distribution ofbackground similarity values as a critical value in rejecting the nullhypothesis of similarity between the two species

24 Pseudoreplication

In the recently published analyses of Ebola potential distribu-tions (Pigott et al 2014) the authors chose to interpret the manycases of Ebola detected in great apes in Gabon and Republic ofthe Congo (Sleeman 2004) as independent transmission events Toassess the effects of this assumption we compared Table 2 of Pigottet al (2014) to the occurrence data matrix cited above we identi-fied single occurrences in the latter that corresponded to multipleoccurrences in the formermdashindeed among other instances of pseu-doreplication in one case 21 occurrences in Table 2 of Pigott et al(2014) related to a single occurrence in our dataset

Hence we developed models for Zaire ebolavirus with the singlerepresentative (no pseudoreplication) and with multiple represen-tatives (pseudoreplication) To mimic the multiple sites cited forthese pseudoreplicates in Table 2 of Pigott et al (2014) instead ofusing a single coordinate pair we used random points cast withinthe uncertainty radius which covered a particularly broad areaWe then compared the resulting two maps detecting areas over-and under-emphasized by models including pseudoreplication Weidentified areas for which the pseudoreplicated model presentedpixel values outside the range of values among 25 replicate modelsdeveloped without pseudoreplication (see next paragraph) thesesites had pseudoreplication effects on model outputs that wereparticularly pronounced

25 Assess uncertainty

In the case of viral diseases as rare as Ebola and Marburg it is notsufficient simply to estimate suitability rather a crucial elementis assessment of uncertainty in model predictions to the extentpossible Hence to incorporate uncertainty deriving from posi-tional considerations into our models we created 25 sets of randompoints cast one per outbreak within the uncertainty radius of eachoutbreak (see Fig 1) following our previous examples (Nakazawaet al 2010 Peterson et al 2006) We then developed 25 repli-cate analyses following protocols listed above each consisting of 10bootstrap subsampling analyses From the 25 medians of the boot-strap replicates we obtained the maximum and minimum valuesand calculated range = maximum minus minimum as an index by whichto characterize uncertainty Because the range of values generatedin this index will be highly dependent on sample sizes we use thisindex principally for comparisons among areas within a particularspecies and not for comparisons among species

26 Assess possibility of ldquosurprisesrdquo

The known history of Ebola virus transmission events hasalready seen three events in which the viruses appearedunexpectedly Zaire ebolavirus and Sudan ebolavirus appearingsimultaneously in 1976 Reston ebolavirus apparently coming fromthe Philippines and Zaire ebolavirus appearing in Guinea in 2014In each case a virus appeared at a place where it was not expectedAs a consequence an important step is to assess the possibility offurther such distributional novelties

We maintained our focus on model calibration within the acces-sible area M but then transferred model predictions across all ofAfrica which is the appropriate means by which to extend nichemodels across areas of potential (but not realized) distribution out-side of our current concept of the accessible area of each species(Owens et al 2013) That is for this analysis we conducted a princi-pal components analysis across the entire continent but extractedthe portion of Africa within each speciesrsquo individual M area formodel calibration Although these principal components were notas specific the procedure has the advantage of putting the envi-ronmental data for within M and across Africa on the same scaleModels were calibrated within M and then transferred to all ofAfrica We assessed the degree to which model transfers would beextrapolative using the MOP metric (Owens et al 2013)

3 Results

We first assessed ecological niche similarity between Zaireebolavirus and Sudan ebolavirus The two null (background simi-larity) distributions (ie Zaire ebolavirus compared to backgroundmodels for Sudan ebolavirus and vice versa) had ranges of similar-ity values of 076ndash092 and 084ndash098 whereas the observed valuewas 048 As a consequence we rejected the null hypothesis of nichesimilarity in favor of an alternative hypothesis of niche differenceAn important implication of this result is that combining two Ebolaspecies with differentiated niches in analyses is not justified ana-lyzing them together would result in overestimating the breadth ofecological niches

Assessing effects of pseudoreplication of occurrence data onmodel outputs multiple ldquocopiesrdquo of occurrence data influencedmodeled distributional areas rather dramatically Specificallypseudoreplication caused underestimation of suitability of areasin southern Gabon southern Republic of the Congo and south-ern and southeastern Democratic Republic of the Congo (Fig 2)and overestimation of suitability in parts of the northern CongoBasin and southern Cameroon These differences caused by pseu-doreplicating occurrence data were lsquoout of rangersquo with regard to ourmodels incorporating positional uncertainty and thereby indicatesubstantial effects of pseudoreplication in terms of underestimat-ing suitability particularly in the southern and southeastern sectorsof the Congo Basin

Modeled suitability and associated uncertainty for each of the 5African filovirus species are summarized in Figs 3 and 4 Specif-ically Sudan ebolavirus was reconstructed as having a potentialdistribution focused in northwestern Uganda northeastern Demo-cratic Republic of the Congo and southern South Sudan Thesepredictions were particularly unstable however in the DemocraticRepublic of the Congo in contrast areas in Uganda were stablyidentified as presenting high suitability Zaire ebolavirus was mod-eled as having potential distributional areas across the known rangeof the species albeit extending more broadly in Cameroon IvoryCoast and other countries predictions in the lower (western) por-tions of the Congo Basin such as in Gabon Equatorial Guinea andsouthern Cameroon showed relatively high uncertainty

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 119

Fig 3 Modeled suitability for three relatively well-known species of filoviruses Sudan ebolavirus Zaire ebolavirus and Marburg Left-hand column maps show suitabilityvalues with darker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based ondifferent random representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a bluerectangle in an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

120 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 4 Modeled suitability for two poorly-known species of filoviruses Taiuml Forest ebolavirus and Bundibugyo ebolavirus Left-hand column maps show suitability values withdarker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based on differentrandom representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a blue rectanglein an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

Modeled potential distributional areas for Marburg were rathermore scattered and diffuse across the accessible area for thisspecies Our models had some level of difficulty in distinguishingbetween the more seasonal and scrubby habitats of East Africa andthe less seasonal habitats of the Congo Basin however uncertaintywas focused in those latter regions of the Congo Basin such that thestrength of the prediction of suitability in those regions was unclearModeled potential distributional areas for Taiuml Forest ebolavirus andBundibugyo ebolavirus were both diffuse across their respectiveaccessible areas and all modeled suitable areas were also highlyuncertain such that the distributions of these two species remainpoorly characterized Relationships between uncertainty and suit-ability were curvilinear and diverse in form in different species(Fig 5) but clearly reflected effects of low sample sizes of knownoccurrences of some species (see in particular Taiuml Forest ebolavirusin Fig 5)

Assessing the possibility of unexpected distributional poten-tial as regards Ebola and Marburg virus distributions some areasemerge as potential distributional areas not within the knowndistributions of the three species for which sample sizes were suf-ficient to permit model development (Fig 6) Specifically for Zaire

ebolavirus two areas could be discerned southern Ethiopia and theGulf of Guinea rim of West Africa For Sudan ebolavirus and Marburgareas in southwestern Ethiopia and East Africa more generally werenotable in terms of distributional potential outside of known areas

4 Discussion

This study updates the current understanding of filovirus geog-raphy in Africa In the 10 years since our original analysis thenumber of known outbreaks has approximately doubled butmost have occurred under quite predictable circumstances how-ever with augmented input data as well as improved analyticalframeworks new maps have been developed that should bothanticipate future outbreak events and highlight areas of uncer-tain predictions Our careful consideration of uncertainty and howits effects percolate through the analysis process however indi-cated that reasonably robust models could be developed onlyfor Zaire ebolavirus Sudan ebolavirus and Marburg for Taiuml Forestebolavirus and Bundibugyo ebolavirus occurrence data are simplyinsufficient to permit rigorous model development at this point

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 3: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

116 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 1 Summary of known occurrences of five filovirus species across Africa with associated uncertainty values These data correspond to the filovirus occurrence data setdescribed in the Methods and made openly available to the scientific community

that respond (at least roughly) to major axes of interest such aslatitude seasonality etc we believe that the high informationcontent of principal component axes combined with their orthogo-nality more than offsets the loss of interpretability associated withcomposite axes (Peterson 2014b) For the initial suite of ecologicalniche models we fit PCAs within M hypotheses for each speciesto be analyzed as M is the only region of importance to the actualdistribution of a species however for the final analysis assessingcontinentwide patterns of suitability and the potential for unex-pected emergence events of filovirus species (see below) we useda single PCA over all of Africa and used the M areas to clip out datasets for analysis so that the environmental data would be on thesame axes between the restricted and continentwide datasets

22 Model calibration

We initially evaluated a series of algorithms for use in this studyparticularly those facilitated by the openModeller platform (deSouza Munoz et al 2011) but we settled on Maxent version 333k(Phillips et al 2006) in light of the convenient bootstrapping fea-tures which allowed us to build in dimensions of uncertainty inour models We used the jackknife option to identify variables notcontributing importantly to model robustness (ie extrinsic gain)

Once a reduced set of variables was identified we conducted 10replicate analyses for each species based on a 50 bootstrap ofavailable occurrence data we used the median logistic output ofreplicate analyses as an appropriate summary although this met-ric has been interpreted in the form of a probability of occurrence(Phillips et al 2006) a more conservative interpretation is as ametric of suitability only (Peterson 2014b) which we follow hereAlthough we much appreciate the importance of thresholding eco-logical niche models to avoid overfitting and overinterpretation ofmodel outputs (Peterson et al 2007) in light of the small samplesizes available to us for all of the species of interest we generallyretained continuous model outputs for interpretation and visual-ization of suitability patterns

23 Assess niche similarity between species

We tested for departure from niche similarity using the back-ground similarity test developed by Warren et al (2008) whichassesses the similarity or difference of two niche estimates in rela-tion to similarity among large numbers of null replicate modelsIn effect the test assesses whether two niche estimates are moredifferent than one would expect given the lsquobackground similar-ity of the two accessible regions For this test we focused on the

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 117

Fig 2 Effects of pseudoreplication on model outputs for Zaire ebolavirus The middle map shows the difference between the logistic modeled suitability scores from modelsbased on pseudoreplicated data mimicking recent modeling efforts (Pigott et al 2014) the upper and lower panels show pixels (in black) in the western and eastern portionsof this speciesrsquo known distribution in which the pseudoreplicated model presented values out of the range observed in 25 replicate analyses of non-pseudoreplicated databased on random points cast across the uncertainty area for each point

118 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

two Ebola species for which reasonable numbers of occurrenceswere available (Zaire ebolavirus with N = 14 and Sudan ebolaviruswith N = 6) We did not develop such tests for Taiuml Forest ebolavirusand Bundibugyo ebolavirus or for the two likely-distinct species ofMarburg (Peterson and Holder 2012) which would have much-reduced statistical power owing to low sample sizes

Random background points were generated from the accessiblearea for each species in numbers equal to numbers of real occur-rence data available for each species with 100 replicate samplesand with the least training presence thresholding option specifiedThese points were used to generate niche models in Maxent andbackground models were compared with the real-species modelin each replicate two such comparisons were developed testingeach of the species against the background of the other We calcu-lated Hellingerrsquos I indices in R (R Development Core Team 2013)for all replicates We used the 5th percentile of each distribution ofbackground similarity values as a critical value in rejecting the nullhypothesis of similarity between the two species

24 Pseudoreplication

In the recently published analyses of Ebola potential distribu-tions (Pigott et al 2014) the authors chose to interpret the manycases of Ebola detected in great apes in Gabon and Republic ofthe Congo (Sleeman 2004) as independent transmission events Toassess the effects of this assumption we compared Table 2 of Pigottet al (2014) to the occurrence data matrix cited above we identi-fied single occurrences in the latter that corresponded to multipleoccurrences in the formermdashindeed among other instances of pseu-doreplication in one case 21 occurrences in Table 2 of Pigott et al(2014) related to a single occurrence in our dataset

Hence we developed models for Zaire ebolavirus with the singlerepresentative (no pseudoreplication) and with multiple represen-tatives (pseudoreplication) To mimic the multiple sites cited forthese pseudoreplicates in Table 2 of Pigott et al (2014) instead ofusing a single coordinate pair we used random points cast withinthe uncertainty radius which covered a particularly broad areaWe then compared the resulting two maps detecting areas over-and under-emphasized by models including pseudoreplication Weidentified areas for which the pseudoreplicated model presentedpixel values outside the range of values among 25 replicate modelsdeveloped without pseudoreplication (see next paragraph) thesesites had pseudoreplication effects on model outputs that wereparticularly pronounced

25 Assess uncertainty

In the case of viral diseases as rare as Ebola and Marburg it is notsufficient simply to estimate suitability rather a crucial elementis assessment of uncertainty in model predictions to the extentpossible Hence to incorporate uncertainty deriving from posi-tional considerations into our models we created 25 sets of randompoints cast one per outbreak within the uncertainty radius of eachoutbreak (see Fig 1) following our previous examples (Nakazawaet al 2010 Peterson et al 2006) We then developed 25 repli-cate analyses following protocols listed above each consisting of 10bootstrap subsampling analyses From the 25 medians of the boot-strap replicates we obtained the maximum and minimum valuesand calculated range = maximum minus minimum as an index by whichto characterize uncertainty Because the range of values generatedin this index will be highly dependent on sample sizes we use thisindex principally for comparisons among areas within a particularspecies and not for comparisons among species

26 Assess possibility of ldquosurprisesrdquo

The known history of Ebola virus transmission events hasalready seen three events in which the viruses appearedunexpectedly Zaire ebolavirus and Sudan ebolavirus appearingsimultaneously in 1976 Reston ebolavirus apparently coming fromthe Philippines and Zaire ebolavirus appearing in Guinea in 2014In each case a virus appeared at a place where it was not expectedAs a consequence an important step is to assess the possibility offurther such distributional novelties

We maintained our focus on model calibration within the acces-sible area M but then transferred model predictions across all ofAfrica which is the appropriate means by which to extend nichemodels across areas of potential (but not realized) distribution out-side of our current concept of the accessible area of each species(Owens et al 2013) That is for this analysis we conducted a princi-pal components analysis across the entire continent but extractedthe portion of Africa within each speciesrsquo individual M area formodel calibration Although these principal components were notas specific the procedure has the advantage of putting the envi-ronmental data for within M and across Africa on the same scaleModels were calibrated within M and then transferred to all ofAfrica We assessed the degree to which model transfers would beextrapolative using the MOP metric (Owens et al 2013)

3 Results

We first assessed ecological niche similarity between Zaireebolavirus and Sudan ebolavirus The two null (background simi-larity) distributions (ie Zaire ebolavirus compared to backgroundmodels for Sudan ebolavirus and vice versa) had ranges of similar-ity values of 076ndash092 and 084ndash098 whereas the observed valuewas 048 As a consequence we rejected the null hypothesis of nichesimilarity in favor of an alternative hypothesis of niche differenceAn important implication of this result is that combining two Ebolaspecies with differentiated niches in analyses is not justified ana-lyzing them together would result in overestimating the breadth ofecological niches

Assessing effects of pseudoreplication of occurrence data onmodel outputs multiple ldquocopiesrdquo of occurrence data influencedmodeled distributional areas rather dramatically Specificallypseudoreplication caused underestimation of suitability of areasin southern Gabon southern Republic of the Congo and south-ern and southeastern Democratic Republic of the Congo (Fig 2)and overestimation of suitability in parts of the northern CongoBasin and southern Cameroon These differences caused by pseu-doreplicating occurrence data were lsquoout of rangersquo with regard to ourmodels incorporating positional uncertainty and thereby indicatesubstantial effects of pseudoreplication in terms of underestimat-ing suitability particularly in the southern and southeastern sectorsof the Congo Basin

Modeled suitability and associated uncertainty for each of the 5African filovirus species are summarized in Figs 3 and 4 Specif-ically Sudan ebolavirus was reconstructed as having a potentialdistribution focused in northwestern Uganda northeastern Demo-cratic Republic of the Congo and southern South Sudan Thesepredictions were particularly unstable however in the DemocraticRepublic of the Congo in contrast areas in Uganda were stablyidentified as presenting high suitability Zaire ebolavirus was mod-eled as having potential distributional areas across the known rangeof the species albeit extending more broadly in Cameroon IvoryCoast and other countries predictions in the lower (western) por-tions of the Congo Basin such as in Gabon Equatorial Guinea andsouthern Cameroon showed relatively high uncertainty

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 119

Fig 3 Modeled suitability for three relatively well-known species of filoviruses Sudan ebolavirus Zaire ebolavirus and Marburg Left-hand column maps show suitabilityvalues with darker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based ondifferent random representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a bluerectangle in an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

120 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 4 Modeled suitability for two poorly-known species of filoviruses Taiuml Forest ebolavirus and Bundibugyo ebolavirus Left-hand column maps show suitability values withdarker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based on differentrandom representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a blue rectanglein an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

Modeled potential distributional areas for Marburg were rathermore scattered and diffuse across the accessible area for thisspecies Our models had some level of difficulty in distinguishingbetween the more seasonal and scrubby habitats of East Africa andthe less seasonal habitats of the Congo Basin however uncertaintywas focused in those latter regions of the Congo Basin such that thestrength of the prediction of suitability in those regions was unclearModeled potential distributional areas for Taiuml Forest ebolavirus andBundibugyo ebolavirus were both diffuse across their respectiveaccessible areas and all modeled suitable areas were also highlyuncertain such that the distributions of these two species remainpoorly characterized Relationships between uncertainty and suit-ability were curvilinear and diverse in form in different species(Fig 5) but clearly reflected effects of low sample sizes of knownoccurrences of some species (see in particular Taiuml Forest ebolavirusin Fig 5)

Assessing the possibility of unexpected distributional poten-tial as regards Ebola and Marburg virus distributions some areasemerge as potential distributional areas not within the knowndistributions of the three species for which sample sizes were suf-ficient to permit model development (Fig 6) Specifically for Zaire

ebolavirus two areas could be discerned southern Ethiopia and theGulf of Guinea rim of West Africa For Sudan ebolavirus and Marburgareas in southwestern Ethiopia and East Africa more generally werenotable in terms of distributional potential outside of known areas

4 Discussion

This study updates the current understanding of filovirus geog-raphy in Africa In the 10 years since our original analysis thenumber of known outbreaks has approximately doubled butmost have occurred under quite predictable circumstances how-ever with augmented input data as well as improved analyticalframeworks new maps have been developed that should bothanticipate future outbreak events and highlight areas of uncer-tain predictions Our careful consideration of uncertainty and howits effects percolate through the analysis process however indi-cated that reasonably robust models could be developed onlyfor Zaire ebolavirus Sudan ebolavirus and Marburg for Taiuml Forestebolavirus and Bundibugyo ebolavirus occurrence data are simplyinsufficient to permit rigorous model development at this point

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 4: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 117

Fig 2 Effects of pseudoreplication on model outputs for Zaire ebolavirus The middle map shows the difference between the logistic modeled suitability scores from modelsbased on pseudoreplicated data mimicking recent modeling efforts (Pigott et al 2014) the upper and lower panels show pixels (in black) in the western and eastern portionsof this speciesrsquo known distribution in which the pseudoreplicated model presented values out of the range observed in 25 replicate analyses of non-pseudoreplicated databased on random points cast across the uncertainty area for each point

118 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

two Ebola species for which reasonable numbers of occurrenceswere available (Zaire ebolavirus with N = 14 and Sudan ebolaviruswith N = 6) We did not develop such tests for Taiuml Forest ebolavirusand Bundibugyo ebolavirus or for the two likely-distinct species ofMarburg (Peterson and Holder 2012) which would have much-reduced statistical power owing to low sample sizes

Random background points were generated from the accessiblearea for each species in numbers equal to numbers of real occur-rence data available for each species with 100 replicate samplesand with the least training presence thresholding option specifiedThese points were used to generate niche models in Maxent andbackground models were compared with the real-species modelin each replicate two such comparisons were developed testingeach of the species against the background of the other We calcu-lated Hellingerrsquos I indices in R (R Development Core Team 2013)for all replicates We used the 5th percentile of each distribution ofbackground similarity values as a critical value in rejecting the nullhypothesis of similarity between the two species

24 Pseudoreplication

In the recently published analyses of Ebola potential distribu-tions (Pigott et al 2014) the authors chose to interpret the manycases of Ebola detected in great apes in Gabon and Republic ofthe Congo (Sleeman 2004) as independent transmission events Toassess the effects of this assumption we compared Table 2 of Pigottet al (2014) to the occurrence data matrix cited above we identi-fied single occurrences in the latter that corresponded to multipleoccurrences in the formermdashindeed among other instances of pseu-doreplication in one case 21 occurrences in Table 2 of Pigott et al(2014) related to a single occurrence in our dataset

Hence we developed models for Zaire ebolavirus with the singlerepresentative (no pseudoreplication) and with multiple represen-tatives (pseudoreplication) To mimic the multiple sites cited forthese pseudoreplicates in Table 2 of Pigott et al (2014) instead ofusing a single coordinate pair we used random points cast withinthe uncertainty radius which covered a particularly broad areaWe then compared the resulting two maps detecting areas over-and under-emphasized by models including pseudoreplication Weidentified areas for which the pseudoreplicated model presentedpixel values outside the range of values among 25 replicate modelsdeveloped without pseudoreplication (see next paragraph) thesesites had pseudoreplication effects on model outputs that wereparticularly pronounced

25 Assess uncertainty

In the case of viral diseases as rare as Ebola and Marburg it is notsufficient simply to estimate suitability rather a crucial elementis assessment of uncertainty in model predictions to the extentpossible Hence to incorporate uncertainty deriving from posi-tional considerations into our models we created 25 sets of randompoints cast one per outbreak within the uncertainty radius of eachoutbreak (see Fig 1) following our previous examples (Nakazawaet al 2010 Peterson et al 2006) We then developed 25 repli-cate analyses following protocols listed above each consisting of 10bootstrap subsampling analyses From the 25 medians of the boot-strap replicates we obtained the maximum and minimum valuesand calculated range = maximum minus minimum as an index by whichto characterize uncertainty Because the range of values generatedin this index will be highly dependent on sample sizes we use thisindex principally for comparisons among areas within a particularspecies and not for comparisons among species

26 Assess possibility of ldquosurprisesrdquo

The known history of Ebola virus transmission events hasalready seen three events in which the viruses appearedunexpectedly Zaire ebolavirus and Sudan ebolavirus appearingsimultaneously in 1976 Reston ebolavirus apparently coming fromthe Philippines and Zaire ebolavirus appearing in Guinea in 2014In each case a virus appeared at a place where it was not expectedAs a consequence an important step is to assess the possibility offurther such distributional novelties

We maintained our focus on model calibration within the acces-sible area M but then transferred model predictions across all ofAfrica which is the appropriate means by which to extend nichemodels across areas of potential (but not realized) distribution out-side of our current concept of the accessible area of each species(Owens et al 2013) That is for this analysis we conducted a princi-pal components analysis across the entire continent but extractedthe portion of Africa within each speciesrsquo individual M area formodel calibration Although these principal components were notas specific the procedure has the advantage of putting the envi-ronmental data for within M and across Africa on the same scaleModels were calibrated within M and then transferred to all ofAfrica We assessed the degree to which model transfers would beextrapolative using the MOP metric (Owens et al 2013)

3 Results

We first assessed ecological niche similarity between Zaireebolavirus and Sudan ebolavirus The two null (background simi-larity) distributions (ie Zaire ebolavirus compared to backgroundmodels for Sudan ebolavirus and vice versa) had ranges of similar-ity values of 076ndash092 and 084ndash098 whereas the observed valuewas 048 As a consequence we rejected the null hypothesis of nichesimilarity in favor of an alternative hypothesis of niche differenceAn important implication of this result is that combining two Ebolaspecies with differentiated niches in analyses is not justified ana-lyzing them together would result in overestimating the breadth ofecological niches

Assessing effects of pseudoreplication of occurrence data onmodel outputs multiple ldquocopiesrdquo of occurrence data influencedmodeled distributional areas rather dramatically Specificallypseudoreplication caused underestimation of suitability of areasin southern Gabon southern Republic of the Congo and south-ern and southeastern Democratic Republic of the Congo (Fig 2)and overestimation of suitability in parts of the northern CongoBasin and southern Cameroon These differences caused by pseu-doreplicating occurrence data were lsquoout of rangersquo with regard to ourmodels incorporating positional uncertainty and thereby indicatesubstantial effects of pseudoreplication in terms of underestimat-ing suitability particularly in the southern and southeastern sectorsof the Congo Basin

Modeled suitability and associated uncertainty for each of the 5African filovirus species are summarized in Figs 3 and 4 Specif-ically Sudan ebolavirus was reconstructed as having a potentialdistribution focused in northwestern Uganda northeastern Demo-cratic Republic of the Congo and southern South Sudan Thesepredictions were particularly unstable however in the DemocraticRepublic of the Congo in contrast areas in Uganda were stablyidentified as presenting high suitability Zaire ebolavirus was mod-eled as having potential distributional areas across the known rangeof the species albeit extending more broadly in Cameroon IvoryCoast and other countries predictions in the lower (western) por-tions of the Congo Basin such as in Gabon Equatorial Guinea andsouthern Cameroon showed relatively high uncertainty

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 119

Fig 3 Modeled suitability for three relatively well-known species of filoviruses Sudan ebolavirus Zaire ebolavirus and Marburg Left-hand column maps show suitabilityvalues with darker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based ondifferent random representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a bluerectangle in an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

120 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 4 Modeled suitability for two poorly-known species of filoviruses Taiuml Forest ebolavirus and Bundibugyo ebolavirus Left-hand column maps show suitability values withdarker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based on differentrandom representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a blue rectanglein an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

Modeled potential distributional areas for Marburg were rathermore scattered and diffuse across the accessible area for thisspecies Our models had some level of difficulty in distinguishingbetween the more seasonal and scrubby habitats of East Africa andthe less seasonal habitats of the Congo Basin however uncertaintywas focused in those latter regions of the Congo Basin such that thestrength of the prediction of suitability in those regions was unclearModeled potential distributional areas for Taiuml Forest ebolavirus andBundibugyo ebolavirus were both diffuse across their respectiveaccessible areas and all modeled suitable areas were also highlyuncertain such that the distributions of these two species remainpoorly characterized Relationships between uncertainty and suit-ability were curvilinear and diverse in form in different species(Fig 5) but clearly reflected effects of low sample sizes of knownoccurrences of some species (see in particular Taiuml Forest ebolavirusin Fig 5)

Assessing the possibility of unexpected distributional poten-tial as regards Ebola and Marburg virus distributions some areasemerge as potential distributional areas not within the knowndistributions of the three species for which sample sizes were suf-ficient to permit model development (Fig 6) Specifically for Zaire

ebolavirus two areas could be discerned southern Ethiopia and theGulf of Guinea rim of West Africa For Sudan ebolavirus and Marburgareas in southwestern Ethiopia and East Africa more generally werenotable in terms of distributional potential outside of known areas

4 Discussion

This study updates the current understanding of filovirus geog-raphy in Africa In the 10 years since our original analysis thenumber of known outbreaks has approximately doubled butmost have occurred under quite predictable circumstances how-ever with augmented input data as well as improved analyticalframeworks new maps have been developed that should bothanticipate future outbreak events and highlight areas of uncer-tain predictions Our careful consideration of uncertainty and howits effects percolate through the analysis process however indi-cated that reasonably robust models could be developed onlyfor Zaire ebolavirus Sudan ebolavirus and Marburg for Taiuml Forestebolavirus and Bundibugyo ebolavirus occurrence data are simplyinsufficient to permit rigorous model development at this point

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 5: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

118 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

two Ebola species for which reasonable numbers of occurrenceswere available (Zaire ebolavirus with N = 14 and Sudan ebolaviruswith N = 6) We did not develop such tests for Taiuml Forest ebolavirusand Bundibugyo ebolavirus or for the two likely-distinct species ofMarburg (Peterson and Holder 2012) which would have much-reduced statistical power owing to low sample sizes

Random background points were generated from the accessiblearea for each species in numbers equal to numbers of real occur-rence data available for each species with 100 replicate samplesand with the least training presence thresholding option specifiedThese points were used to generate niche models in Maxent andbackground models were compared with the real-species modelin each replicate two such comparisons were developed testingeach of the species against the background of the other We calcu-lated Hellingerrsquos I indices in R (R Development Core Team 2013)for all replicates We used the 5th percentile of each distribution ofbackground similarity values as a critical value in rejecting the nullhypothesis of similarity between the two species

24 Pseudoreplication

In the recently published analyses of Ebola potential distribu-tions (Pigott et al 2014) the authors chose to interpret the manycases of Ebola detected in great apes in Gabon and Republic ofthe Congo (Sleeman 2004) as independent transmission events Toassess the effects of this assumption we compared Table 2 of Pigottet al (2014) to the occurrence data matrix cited above we identi-fied single occurrences in the latter that corresponded to multipleoccurrences in the formermdashindeed among other instances of pseu-doreplication in one case 21 occurrences in Table 2 of Pigott et al(2014) related to a single occurrence in our dataset

Hence we developed models for Zaire ebolavirus with the singlerepresentative (no pseudoreplication) and with multiple represen-tatives (pseudoreplication) To mimic the multiple sites cited forthese pseudoreplicates in Table 2 of Pigott et al (2014) instead ofusing a single coordinate pair we used random points cast withinthe uncertainty radius which covered a particularly broad areaWe then compared the resulting two maps detecting areas over-and under-emphasized by models including pseudoreplication Weidentified areas for which the pseudoreplicated model presentedpixel values outside the range of values among 25 replicate modelsdeveloped without pseudoreplication (see next paragraph) thesesites had pseudoreplication effects on model outputs that wereparticularly pronounced

25 Assess uncertainty

In the case of viral diseases as rare as Ebola and Marburg it is notsufficient simply to estimate suitability rather a crucial elementis assessment of uncertainty in model predictions to the extentpossible Hence to incorporate uncertainty deriving from posi-tional considerations into our models we created 25 sets of randompoints cast one per outbreak within the uncertainty radius of eachoutbreak (see Fig 1) following our previous examples (Nakazawaet al 2010 Peterson et al 2006) We then developed 25 repli-cate analyses following protocols listed above each consisting of 10bootstrap subsampling analyses From the 25 medians of the boot-strap replicates we obtained the maximum and minimum valuesand calculated range = maximum minus minimum as an index by whichto characterize uncertainty Because the range of values generatedin this index will be highly dependent on sample sizes we use thisindex principally for comparisons among areas within a particularspecies and not for comparisons among species

26 Assess possibility of ldquosurprisesrdquo

The known history of Ebola virus transmission events hasalready seen three events in which the viruses appearedunexpectedly Zaire ebolavirus and Sudan ebolavirus appearingsimultaneously in 1976 Reston ebolavirus apparently coming fromthe Philippines and Zaire ebolavirus appearing in Guinea in 2014In each case a virus appeared at a place where it was not expectedAs a consequence an important step is to assess the possibility offurther such distributional novelties

We maintained our focus on model calibration within the acces-sible area M but then transferred model predictions across all ofAfrica which is the appropriate means by which to extend nichemodels across areas of potential (but not realized) distribution out-side of our current concept of the accessible area of each species(Owens et al 2013) That is for this analysis we conducted a princi-pal components analysis across the entire continent but extractedthe portion of Africa within each speciesrsquo individual M area formodel calibration Although these principal components were notas specific the procedure has the advantage of putting the envi-ronmental data for within M and across Africa on the same scaleModels were calibrated within M and then transferred to all ofAfrica We assessed the degree to which model transfers would beextrapolative using the MOP metric (Owens et al 2013)

3 Results

We first assessed ecological niche similarity between Zaireebolavirus and Sudan ebolavirus The two null (background simi-larity) distributions (ie Zaire ebolavirus compared to backgroundmodels for Sudan ebolavirus and vice versa) had ranges of similar-ity values of 076ndash092 and 084ndash098 whereas the observed valuewas 048 As a consequence we rejected the null hypothesis of nichesimilarity in favor of an alternative hypothesis of niche differenceAn important implication of this result is that combining two Ebolaspecies with differentiated niches in analyses is not justified ana-lyzing them together would result in overestimating the breadth ofecological niches

Assessing effects of pseudoreplication of occurrence data onmodel outputs multiple ldquocopiesrdquo of occurrence data influencedmodeled distributional areas rather dramatically Specificallypseudoreplication caused underestimation of suitability of areasin southern Gabon southern Republic of the Congo and south-ern and southeastern Democratic Republic of the Congo (Fig 2)and overestimation of suitability in parts of the northern CongoBasin and southern Cameroon These differences caused by pseu-doreplicating occurrence data were lsquoout of rangersquo with regard to ourmodels incorporating positional uncertainty and thereby indicatesubstantial effects of pseudoreplication in terms of underestimat-ing suitability particularly in the southern and southeastern sectorsof the Congo Basin

Modeled suitability and associated uncertainty for each of the 5African filovirus species are summarized in Figs 3 and 4 Specif-ically Sudan ebolavirus was reconstructed as having a potentialdistribution focused in northwestern Uganda northeastern Demo-cratic Republic of the Congo and southern South Sudan Thesepredictions were particularly unstable however in the DemocraticRepublic of the Congo in contrast areas in Uganda were stablyidentified as presenting high suitability Zaire ebolavirus was mod-eled as having potential distributional areas across the known rangeof the species albeit extending more broadly in Cameroon IvoryCoast and other countries predictions in the lower (western) por-tions of the Congo Basin such as in Gabon Equatorial Guinea andsouthern Cameroon showed relatively high uncertainty

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 119

Fig 3 Modeled suitability for three relatively well-known species of filoviruses Sudan ebolavirus Zaire ebolavirus and Marburg Left-hand column maps show suitabilityvalues with darker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based ondifferent random representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a bluerectangle in an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

120 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 4 Modeled suitability for two poorly-known species of filoviruses Taiuml Forest ebolavirus and Bundibugyo ebolavirus Left-hand column maps show suitability values withdarker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based on differentrandom representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a blue rectanglein an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

Modeled potential distributional areas for Marburg were rathermore scattered and diffuse across the accessible area for thisspecies Our models had some level of difficulty in distinguishingbetween the more seasonal and scrubby habitats of East Africa andthe less seasonal habitats of the Congo Basin however uncertaintywas focused in those latter regions of the Congo Basin such that thestrength of the prediction of suitability in those regions was unclearModeled potential distributional areas for Taiuml Forest ebolavirus andBundibugyo ebolavirus were both diffuse across their respectiveaccessible areas and all modeled suitable areas were also highlyuncertain such that the distributions of these two species remainpoorly characterized Relationships between uncertainty and suit-ability were curvilinear and diverse in form in different species(Fig 5) but clearly reflected effects of low sample sizes of knownoccurrences of some species (see in particular Taiuml Forest ebolavirusin Fig 5)

Assessing the possibility of unexpected distributional poten-tial as regards Ebola and Marburg virus distributions some areasemerge as potential distributional areas not within the knowndistributions of the three species for which sample sizes were suf-ficient to permit model development (Fig 6) Specifically for Zaire

ebolavirus two areas could be discerned southern Ethiopia and theGulf of Guinea rim of West Africa For Sudan ebolavirus and Marburgareas in southwestern Ethiopia and East Africa more generally werenotable in terms of distributional potential outside of known areas

4 Discussion

This study updates the current understanding of filovirus geog-raphy in Africa In the 10 years since our original analysis thenumber of known outbreaks has approximately doubled butmost have occurred under quite predictable circumstances how-ever with augmented input data as well as improved analyticalframeworks new maps have been developed that should bothanticipate future outbreak events and highlight areas of uncer-tain predictions Our careful consideration of uncertainty and howits effects percolate through the analysis process however indi-cated that reasonably robust models could be developed onlyfor Zaire ebolavirus Sudan ebolavirus and Marburg for Taiuml Forestebolavirus and Bundibugyo ebolavirus occurrence data are simplyinsufficient to permit rigorous model development at this point

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 6: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 119

Fig 3 Modeled suitability for three relatively well-known species of filoviruses Sudan ebolavirus Zaire ebolavirus and Marburg Left-hand column maps show suitabilityvalues with darker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based ondifferent random representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a bluerectangle in an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

120 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 4 Modeled suitability for two poorly-known species of filoviruses Taiuml Forest ebolavirus and Bundibugyo ebolavirus Left-hand column maps show suitability values withdarker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based on differentrandom representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a blue rectanglein an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

Modeled potential distributional areas for Marburg were rathermore scattered and diffuse across the accessible area for thisspecies Our models had some level of difficulty in distinguishingbetween the more seasonal and scrubby habitats of East Africa andthe less seasonal habitats of the Congo Basin however uncertaintywas focused in those latter regions of the Congo Basin such that thestrength of the prediction of suitability in those regions was unclearModeled potential distributional areas for Taiuml Forest ebolavirus andBundibugyo ebolavirus were both diffuse across their respectiveaccessible areas and all modeled suitable areas were also highlyuncertain such that the distributions of these two species remainpoorly characterized Relationships between uncertainty and suit-ability were curvilinear and diverse in form in different species(Fig 5) but clearly reflected effects of low sample sizes of knownoccurrences of some species (see in particular Taiuml Forest ebolavirusin Fig 5)

Assessing the possibility of unexpected distributional poten-tial as regards Ebola and Marburg virus distributions some areasemerge as potential distributional areas not within the knowndistributions of the three species for which sample sizes were suf-ficient to permit model development (Fig 6) Specifically for Zaire

ebolavirus two areas could be discerned southern Ethiopia and theGulf of Guinea rim of West Africa For Sudan ebolavirus and Marburgareas in southwestern Ethiopia and East Africa more generally werenotable in terms of distributional potential outside of known areas

4 Discussion

This study updates the current understanding of filovirus geog-raphy in Africa In the 10 years since our original analysis thenumber of known outbreaks has approximately doubled butmost have occurred under quite predictable circumstances how-ever with augmented input data as well as improved analyticalframeworks new maps have been developed that should bothanticipate future outbreak events and highlight areas of uncer-tain predictions Our careful consideration of uncertainty and howits effects percolate through the analysis process however indi-cated that reasonably robust models could be developed onlyfor Zaire ebolavirus Sudan ebolavirus and Marburg for Taiuml Forestebolavirus and Bundibugyo ebolavirus occurrence data are simplyinsufficient to permit rigorous model development at this point

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 7: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

120 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Fig 4 Modeled suitability for two poorly-known species of filoviruses Taiuml Forest ebolavirus and Bundibugyo ebolavirus Left-hand column maps show suitability values withdarker orange indicating more suitable conditions right-hand column maps show uncertainty in terms of range of median values when models were based on differentrandom representatives of occurrences within specific uncertainty radii (blue = low uncertainty red = high uncertainty) Location of the maps is indicated via a blue rectanglein an index map for each species (For interpretation of the references to colour in this figure legend the reader is referred to the web version of this article)

Modeled potential distributional areas for Marburg were rathermore scattered and diffuse across the accessible area for thisspecies Our models had some level of difficulty in distinguishingbetween the more seasonal and scrubby habitats of East Africa andthe less seasonal habitats of the Congo Basin however uncertaintywas focused in those latter regions of the Congo Basin such that thestrength of the prediction of suitability in those regions was unclearModeled potential distributional areas for Taiuml Forest ebolavirus andBundibugyo ebolavirus were both diffuse across their respectiveaccessible areas and all modeled suitable areas were also highlyuncertain such that the distributions of these two species remainpoorly characterized Relationships between uncertainty and suit-ability were curvilinear and diverse in form in different species(Fig 5) but clearly reflected effects of low sample sizes of knownoccurrences of some species (see in particular Taiuml Forest ebolavirusin Fig 5)

Assessing the possibility of unexpected distributional poten-tial as regards Ebola and Marburg virus distributions some areasemerge as potential distributional areas not within the knowndistributions of the three species for which sample sizes were suf-ficient to permit model development (Fig 6) Specifically for Zaire

ebolavirus two areas could be discerned southern Ethiopia and theGulf of Guinea rim of West Africa For Sudan ebolavirus and Marburgareas in southwestern Ethiopia and East Africa more generally werenotable in terms of distributional potential outside of known areas

4 Discussion

This study updates the current understanding of filovirus geog-raphy in Africa In the 10 years since our original analysis thenumber of known outbreaks has approximately doubled butmost have occurred under quite predictable circumstances how-ever with augmented input data as well as improved analyticalframeworks new maps have been developed that should bothanticipate future outbreak events and highlight areas of uncer-tain predictions Our careful consideration of uncertainty and howits effects percolate through the analysis process however indi-cated that reasonably robust models could be developed onlyfor Zaire ebolavirus Sudan ebolavirus and Marburg for Taiuml Forestebolavirus and Bundibugyo ebolavirus occurrence data are simplyinsufficient to permit rigorous model development at this point

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 8: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 121

Fig 5 Plots of model uncertainty calculated as range of values observed across 25 replicate analyses based on different sets of random points from across the uncertaintyareas of each point versus modeled suitability Key areas would be those showing high suitability and low uncertainty in the lower-right quadrant of this figure

Fig 6 Analysis of potential for occurrence of filovirus species outside of their known ranges and assumed accessible areas (inset maps show results of the MOP analyses inwhich blue areas are safe for interpretation but red areas are extrapolative and should not be interpreted)

For Reston ebolavirus the virus is known only from non-human pri-mate infections deriving from captive primate colonies for whichno geographic location of transmission from natural reservoir isknown as such the geographic potential of this species remainsentirely obscure

The four previous analyses of filovirus potential distributionalareas (Peterson et al 2004 2006 Pigott et al 2014 2015) were alllacking in terms of current needs for accurate mapping The olderpair of analyses (Peterson et al 2004 2006) developed by Petersonwere based on input occurrence data that included only about halfof the records now available such that those analyses are now quitedated They were also based on rather dated analytical workflowsthat did not take advantage of more recent advances in method-

ology (eg Barve et al 2011) Use of GARP (Stockwell and Peters1999) in the older studies was an optimal choice at the time but wewere not satisfied by clustered patterns of omission error in initialexperimental models developed in GARP for the present study

41 Filovirus mapping efforts

The more recent analyses (Pigott et al 2014 2015) were com-promised by a series of methodological decisions These studiesopted for use of recent (2000ndash2012) satellite imagery as environ-mental data which sums all land use change since 1976 into arather dramatic discord between occurrence data and environmen-tal data our approach minimized this disagreement by choosing

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 9: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

122 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

environmental data that coincided with the approximate midpointof the dates of the occurrence data As such yet a further map-ping effort taking best advantage of many lessons learned aboutecological niche modeling (Peterson 2014b) was in order

In these recent analyses (Pigott et al 2014 2015) no consider-ation was paid to accessible areas (ie the specific areas that havebeen available and accessible to species over relevant time peri-ods) in model calibration (Barve et al 2011) In addition all Ebolaspecies were lumped together for analysis notwithstanding possi-ble niche divergence among them (Warren et al 2008) which wasconfirmed in our analyses at least for the two best-known speciesthis lsquolumpingrsquo of differentiated taxa will generally result in overly-broad estimates of ecological niches No assessment was made ofuncertainty in the predictions that were produced such that thereader is left only with a prediction and no idea as to confidence inpredictions for any particular species or place

Finally and perhaps most importantly in the occurrence dataused to calibrate models in the Ebola-focused paper (Pigott et al2014) individual reservoir-to-nonreservoir mammal events werepseudoreplicated that is no care was taken to distinguish amongmultiple versions of the same emergence event in which the virusjumped to nonreservoir mammals but then may have spreadamong the nonreservoir animals (gorillas in particular) We suspectthat this approach springs from the authorsrsquo focus on the ldquozoonoticnicherdquo (Pigott et al 2014 2015) wherein non-human primateswould be considered as zoonotic intermediate hosts though notnecessarily representative of the ecology of the reservoir and trans-mission from the reservoir to humans Such errors of lack of carewith pseudoreplication and overrepresentation of areas in modelcalibration have been made in previous analyses (Fichet-Calvet andRogers 2009) yet affect the outcomes of niche modeling effortsrather dramatically (Peterson et al 2014)

42 Species accounts

421 Zaire ebolavirusThis virus is the best-known and best-documented of the

filoviruses with a known range across the humid rainforest beltof Africa Beyond its known points of occurrence our models sug-gested the potential for occurrence farther to the east in the DRC(moderate suitability with moderate uncertainty) and west intoGabon Equatorial Guinea and Cameroon (albeit with higher uncer-tainty) The West African distributional area of this virus did nothave a strong signal of high suitability with low uncertainty whichperhaps is a more general quality of this virus (Fig 5) Beyond itscurrent assumed distributional limits our model transfer exercisesuggested that southern Ethiopia may represent a potential dis-tributional area for this species although biogeographic barriersisolating Ethiopia from the rest of Africa are significant (Petersonand Martiacutenez-Meyer 2007)

422 Sudan ebolavirusThis virus has a restricted distribution yet its ecological niche

signature is the clearest of the viruses analyzed in this study Ascan be seen in Fig 5 the areas predicted as most suitable for thisspecies were identified as such with low uncertainty That is areasof northwestern Uganda particularly between Lake Albert and LakeVictoria were identified as highly suitable with low uncertaintynortheastern DRC held areas of modeled high suitability but withhigher uncertainty Model transfer exercises suggested the possi-bility of suitable areas for this species in Gabon Republic of theCongo and in West Africa although such far-removed areas are ofunknown significance

423 MarburgMarburg represents a relatively enigmatic filovirus in terms of

its diversity ecology and geography That is early Marburg detec-tions were concentrated in East Africa but one early outbreak wasfrom considerably farther south in Zimbabwe (Conrad et al 1978)Peterson et al (2006) found that the Zimbabwe outbreak matchedthe niche lsquosignaturersquo of the East African outbreaks and success-fully anticipated the potential for Marburg to emerge elsewhere insouthern Africa including an Angola outbreak (Towner et al 2006)Our new models identified broad suitable areas across the north-eastern DRC central Uganda central Kenya northern Angola andsouthwesternmost DRC many areas of high suitability were iden-tified as such with low uncertainty (Fig 5) such that these modelpredictions appear to be useful In broader projections of the Mar-burg niche model broader areas of northeastern DRC southernTanzania and Mozambique were also identified as matching theenvironmental profile of this virus

However Marburg may present more complexity than iscurrently appreciated two major lineages exist within thecurrently-recognized species (Johnson et al 1996) which occursympatrically at least in East Africa Although the two lineages areclearly on separate evolutionary trajectories (Peterson and Holder2012) current taxonomic approaches for viruses obfuscate thisvery-real diversity (Peterson 2014a) As such the true diversityof Marburg viruses and implications for host associations ecol-ogy and geographic range must await availability of additionalinformation

424 Bundibugyo ebolavirusThis virus was described only recently as a new filovirus (Towner

et al 2008) In spite of minimal occurrence information model out-puts identified areas of the northeastern DRC southern Uganda andsouthwestern Kenya as potentially suitable for this species How-ever uncertainty values were high in essentially all of the areasidentified as suitable (Fig 5) such that no clear prediction of suit-ability was available

425 Taiuml Forest ebolavirusTaiuml Forest ebolavirus certainly ranks among the least-known

filoviruses with Bundibugyo ebolavirus (Towner et al 2008) anda poorly-known filovirus in Europe (Negredo et al 2011) thisknowledge vacuum was reflected in our modeling outputs aswe found no clear signal of suitability likely a consequence ofextremely thin knowledge In brief Taiuml Forest ebolavirus is knownbest from a single well-documented case in which a veterinar-ian performing an autopsy on a dead chimpanzee became infected(Formenty et al 1999 Le Guenno et al 1995) Among the manycompilations of filovirus occurrences in recent publications (Bauschand Schwarz 2014 Changula et al 2014 Chippaux 2014 Leroyet al 2009 Mylne et al 2014 Pourrut et al 2005 Roddy 2014)however only Chippaux (2014) mentions a second detection ofwhat was apparently this species which was included in our ear-lier modeling efforts (Peterson et al 2004) Indeed Mylne et al(2014) went so far as to state explicitly that only a single case of thisvirus was known The case in question was of a man who crossedfrom Liberia into Ivory Coast as a refugee and presented symptomsof viral hemorrhagic fever (WHO 1995ab) he was diagnosedvia serological assays at the Pasteur Institute and appears to haveinfected no further individuals

This 1995 case is of interest for a number of reasons First of allit apparently doubles the information available about the spatialdistribution of one of the least-known filoviruses and for that rea-son alone merits exploration Second however and perhaps moreinteresting still is that this 1995 case was apparently diagnosedon serological grounds only (WHO 1995ab) and apparently hasnever been sequenced As no detail is provided regarding the sero-

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 10: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124 123

logical techniques used in the diagnosis we mention the possibilitythat this case could thus pertain to either Taiuml Forest ebolavirus orZaire ebolavirus Perhaps no samples have been retained from thisparticular case but it certainly is an intriguing historical point thatshould not be lost from the view of the field

43 Caveats

The analyses that we have presented herein do come witha number of caveats of course The time-averaged nature ofour niche modeling analyses represents a rather-general limita-tion of the correlational niche modeling paradigm as it presentlystandsmdashpreliminary explorations of the possibility of linking envi-ronmental data that are specific to the timing of each individualoccurrence datum have been promising (Peterson et al 2005)However the methodology has not seen adequate testing as of yetparticularly for species with small sample sizes of occurrence datasuch as the species analyzed herein

The non-analogous environments across some of the broaderprojections complicate our attempts to assess the potential for sur-prise outbreaks in the future That is our hypotheses of accessibleareas were relatively narrow which affects the generality and reli-ability of model transfers to other regions (Owens et al 2013) Nosolution is available to these complications of extrapolation so wesimply attempt to interpret model transfers under extrapolativeconditions with care

44 Conclusions

The geography of the African filoviruses presents several fas-cinating features on which we comment here Until recently theEbola virus complex appeared to be distributed in a mosaic acrossAfrica in which no pair of species co-occurred or approached oneanother this picture has changed rather drastically in recent yearsParticularly intriguing is the concentration of species diversity inEast Africamdashin a small area of southern South Sudan Uganda north-eastern Democratic Republic of the Congo and western Kenyathree filovirus species have been documented Indeed if recentsuggestions about lineage independence and species limits withinMarburg (Peterson and Holder 2012) were heeded yet a fourthspecies would be recognized from the region In contrast the CongoBasin appears to house but a single species in spite of many morehuman and ape infections known and occurrences of uncertainsignificance in bats (Towner et al 2007) particularly in terms ofstudies based on serological detections

Although the great bulk of Ebola and Marburg cases that haveaccumulated in the past decade (ie since the initial modelingefforts a decade ago) is centered within the bounds of the then-known and modeled-suitable areas the 2014 Guinea outbreakreminds scientists that current knowledge is quite incomplete andthat the viruses may emerge in novel areas Our analysis of the pos-sibility of such events suggests that a clear region of concern couldbe the highlands of southern Ethiopia (both Ebola and Marburg) andregions of southern Africa such as southern Tanzania MozambiqueZambia and parts of South Africa (Marburg only Fig 6) Althoughappearance of filoviruses in these areas would indeed be surprisinginvestment of resources in education to avoid medical and publichealth personnel being caught at unawares is probably a good idea

Acknowledgements

We thank Lindsay Campbell for help with analyses and com-ments on an early draft of the manuscript and the EgyptianFulbright Mission Program (EFMP) for support of AMS during devel-opment of this contribution

References

Amman BR Carroll SA Reed ZD Sealy TK Balinandi S Swanepoel R KempA Erickson BR Comer JA Campbell S 2012 Seasonal pulses of Marburgvirus circulation in juvenile Rousettus aegyptiacus bats coincide with periods ofincreased risk of human infection PLoS Pathog 8 e1002877

Amman BR Jones ME Sealy TK Uebelhoer LS Schuh AJ Bird BHColeman-McCray JD Martin BE Nichol ST Towner JS 2015 Oralshedding of Marburg virus in experimentally infected Egyptian fruit bats(Rousettus aegyptiacus) J Wildl Dis 51

Barve N Barve V Jimeacutenez-Valverde A Lira-Noriega A Maher SP PetersonAT Soberoacuten J Villalobos F 2011 The crucial role of the accessible area inecological niche modeling and species distribution modeling Ecol Model 2221810ndash1819

Bausch DG Schwarz L 2014 Outbreak of Ebola virus disease in Guinea whereecology meets economy PLoS Negl Trop Dis 8 e3056

Changula K Kajihara M Mweene AS Takada A 2014 Ebola and Marburg virusdiseases in Africa increased risk of outbreaks in previously unaffected areasMicrobiol Immunol 58 483ndash491

Chippaux J-P 2014 Outbreaks of Ebola virus disease in Africa the beginnings of atragic saga J Venom Anim Toxins Incl Trop Dis 20 44

Conrad JL Isaacson M Smith EB Wulff H Crees M Geldenhuys P JohnstonJ 1978 Epidemiologic investigation of Marburg virus disease southern Africa1975 Am J Trop Med Hyg 27 1210ndash1215

R Development Core Team 2013 R A Language and Environment for StatisticalComputing httpwwwR-projectorg Vienna

Fichet-Calvet E Rogers DJ 2009 Risk maps of Lassa fever in west Africa PLoSNegl Trop Dis 3 e388

Formenty P Boesch C Wyers M Steiner C Donati F Dind F Walker F LeGuenno B 1999 Ebola virus outbreak among wild chimpanzees living in arain forest of Cote drsquoIvoire J Infect Dis 179 S120ndashS126

Groseth A Feldmann H Strong JE 2007 The ecology of Ebola virus TrendsMicrobiol 15 408ndash416

Hayman DT Yu M Crameri G Wang L-F Suu-Ire R Wood JLNCunningham AA 2012 Ebola virus antibodies in fruit bats Ghana WestAfrica Emerg Infect Dis 18 1207ndash1209

James M Kalluri S 1994 The Pathfinder AVHRR land data set an improvedcoarse resolution data set for terrestrial monitoring Int J Remote Sens 153347ndash3363

Johnson ED Johnson BK Silverstein D Tukei P Geisbert TW Sanchez ANJahrling PB 1996 Characterization of a new Marburg virus isolated from a1987 fatal case in Kenya Arch Virol 101ndash114

Le Guenno B Formentry P Wyers M Guonon P Walker F Boesch C 1995Isolation and partial characterisation of a new strain of ebola virus Lancet 3451271ndash1274

Leroy EM Epelboin A Mondonge V Pourrut X Gonzalez J-PMuyembe-Tamfum J-J Formenty P 2009 Human Ebola outbreak resultingfrom direct exposure to fruit bats in Luebo Democratic Republic of Congo2007 Vector-borne Zoonotic Dis 9 723ndash728

Mylne A Brady OJ Huang Z Pigott DM Golding N Kraemer MU Hay SI2014 A comprehensive database of the geographic spread of past human Ebolaoutbreaks Sci Data 1 140042

Nakazawa Y Williams R Peterson AT Mead P Kugeler K Petersen J 2010Ecological niche modeling of Francisella tularensis subspecies and clades in theUnited States Am J Trop Med Hyg 82 912ndash918

Negredo A Palacios G Vaacutezquez-Moroacuten S Gonzaacutelez F Dopazo H Molero FJuste J Quetglas J Savji N de la Cruz Martiacutenez M 2011 Discovery of anEbolavirus-like filovirus in Europe PLoS Pathog 7 e1002304

Owens HL Campbell LP Dornak L Saupe EE Barve N Soberoacuten J IngenloffK Lira-Noriega A Hensz CM Myers CE Peterson AT 2013 Constraintson interpretation of ecological niche models by limited environmental rangeson calibration areas Ecol Model 263 10ndash18

Peterson AT Holder MT 2012 Phylogenetic assessment of filoviruses howmany lineages of Marburgvirus Ecol Evol 2 1826ndash1833

Peterson AT Martiacutenez-Meyer E 2007 Geographic evaluation of conservationstatus of African forest squirrels (Sciuridae) considering land use change andclimate change the importance of point data Biodivers Conserv 163939ndash3950

Peterson AT Bauer JT Mills JN 2004 Ecological and geographic distribution offilovirus disease Emerg Infect Dis 10 40ndash47

Peterson AT Martiacutenez-Campos C Nakazawa Y Martiacutenez-Meyer E 2005Time-specific ecological niche modeling predicts spatial dynamics of vectorinsects and human dengue cases Trans R Soc Trop Med Hyg 99 647ndash655

Peterson AT Lash RR Carroll DS Johnson KM 2006 Geographic potential foroutbreaks of Marburg hemorrhagic fever Am J Trop Med Hyg 75 9ndash15

Peterson AT Papes M Eaton M 2007 Transferability and model evaluation inecological niche modeling a comparison of GARP and Maxent Ecography 30550ndash560

Peterson AT Soberoacuten J Pearson RG Anderson RP Martiacutenez-Meyer ENakamura M Arauacutejo MB 2011 Ecological Niches and GeographicDistributions Princeton University Press Princeton

Peterson AT Moses LM Bausch DG 2014 Mapping transmission risk of Lassafever in West Africa the importance of quality control sampling bias anderror weighting PLoS One 9 e100711

Peterson AT 2014a Defining viral species making taxonomy useful Virol J 11131

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135

Page 11: Geographic potential of disease caused by Ebola and ...arntd.org/wp-content/uploads/2017/07/Geographic... · documented of hemorrhagic fever caused by viruses of the family Filoviridae,

124 AT Peterson AM Samy Acta Tropica 162 (2016) 114ndash124

Peterson AT 2014b Mapping Disease Transmission Risk in Geographic andEcological Contexts Johns Hopkins University Press Baltimore

Phillips SJ Anderson RP Schapire RE 2006 Maximum entropy modeling ofspecies geographic distributions Ecol Model 190 231ndash259

Pigott DM Golding N Mylne A Huang Z Henry AJ Weiss DJ Brady OJKraemer MUG Smith DL Moyes CL 2014 Mapping the zoonotic niche ofEbola virus disease in Africa Elife 3 e04395

Pigott DM Golding N Mylne A Huang Z Weiss DJ Brady OJ Kraemer MUHay SI 2015 Mapping the zoonotic niche of Marburg virus disease in AfricaTrans R Soc Trop Med Hyg 109 366ndash378

Polonsky JA Wamala JF de Clerck H Van Herp M Sprecher A Porten KShoemaker T 2014 Emerging filoviral disease in Uganda proposedexplanations and research directions Am J Trop Med Hyg 90 790ndash793

Pourrut X Kumulungui B Wittmann T Moussavou G Deacutelicat A Yaba PNkoghe D Gonzalez J-P Leroy EM 2005 The natural history of Ebola virusin Africa Microbes Infect 7 1005ndash1014

Pourrut X Souris M Towner JS Rollin PE Nichol ST Gonzalez J-P Leroy E2009 Large serological survey showing cocirculation of Ebola and Marburgviruses in Gabonese bat populations and a high seroprevalence of both virusesin Rousettus aegyptiacus BMC Infect Dis 9 159

Roddy P 2014 A call to action to enhance filovirus disease outbreak preparednessand response Viruses 6 3699ndash3718

Sleeman JM 2004 The role of Ebola virus in the decline of great ape populationsOryx 38 136

Stockwell DRB Peters DP 1999 The GARP modelling system problems andsolutions to automated spatial prediction Int J Geogr Inf Sci 13 143ndash158

Towner JS Khristova ML Sealy TK Vincent MJ Erickson BR Bawiec DAHartman AL Comer JA Zaki S Stroumlher U Gomes da Silva F del Castillo FRollin PE Ksiazek TG Nichol ST 2006 Marburgvirus genomics andassociation with a large hemorrhagic fever outbreak in Angola J Virol 806497ndash6516

Towner JS Pourrut X Albarino CG Nkogue CN Bird BH Grard G KsiazekTG Gonzalez J-P Nichol ST Leroy EM 2007 Marburg virus infectiondetected in a common African bat PLoS One 8 e764

Towner JS Sealy TK Khristova ML Albarino CG Conlan S Reeder SAQuan P-L Lipkin WI Downing R Tappero JW Okware S Lutwama JBakamutumaho B Kayiwa J Comer JA Rollin PE Ksiazek TG Nichol ST2008 Newly discovered Ebola virus associated with hemorrhagic feveroutbreak in Uganda PLoS Pathog 4 e1000212

Towner JS Amman BR Sealy TK Carroll SAR Comer JA Kemp ASwanepoel R Paddock CD Balinandi S Khristova ML 2009 Isolation ofgenetically diverse Marburg viruses from Egyptian fruit bats PLoS Pathog 5e1000536

UMD 2001 AVHRR NDVI Data Set University of Maryland College Park Marylandhttpglcfumiacsumdeduindexshtml

Veloz SD 2009 Spatially autocorrelated sampling falsely inflates measures ofaccuracy for presence-only niche models J Biogeogr 36 2290ndash2299

WHO 1995a Ebola haemorrhagic fever in Cocircte drsquoIvoire Wkly Epidemiol Rec 70367

WHO 1995b Ebola haemorrhagic fever confirmed case in Cocircte drsquoIvoire andsuspect cases in Liberia Wkly Epidemiol Rec 70 359

Warren DL Glor RE Turelli M 2008 Environmental niche equivalency versusconservatism quantitative approaches to niche evolution Evolution 622868ndash2883

de Oliveira G Rangel TF Lima-Ribeiro MS Terribile LC Diniz JAF 2014Evaluating partitioning and mapping the spatial autocorrelation componentin ecological niche modeling a new approach based on environmentallyequidistant records Ecography 37 637ndash647

de Souza Munoz M de Giovanni R de Siqueira M Sutton T Brewer P PereiraR Canhos D Canhos V 2011 openModeller a generic approach to speciesrsquopotential distribution modelling GeoInformatica 15 111ndash135