Absence of evidence for the conservation outcomes of systematic conservation planning around the globe: a systematic map

Background: Systematic conservation planning is a discipline concerned with the prioritisation of resources for biodiversity conservation and is often used in the design or assessment of terrestrial and marine protected area networks. Despite being an evidence-based discipline, to date there has been no comprehensive review of the outcomes of systematic conservation plans and assessments of the relative effectiveness of applications in different contexts. To address this fundamental gap in knowledge, our primary research question was: what is the extent, distribution and robustness of evidence on conservation outcomes of systematic conservation planning around the globe? Methods: A systematic mapping exercise was undertaken using standardised search terms across 29 sources, including publication databases, online repositories and a wide range of grey literature sources. The review team screened articles recursively, first by title only, then abstract and finally by full-text, using inclusion criteria related to systematic conservation plans conducted at sub-global scales and reported on since 1983. We sought studies that reported outcomes relating to natural, human, social, financial or institutional outcomes and which employed robust evaluation study designs. The following information was extracted from included studies: bibliographic details, background information including location of study and broad objectives of the plan, study design, reported outcomes and context. Results: Of the approximately 10,000 unique articles returned through our searches, 1209 were included for full-text screening and 43 studies reported outcomes of conservation planning interventions. However, only three studies involved the use of evaluation study designs which are suitably rigorous for inclusion, according to best-practice guidelines. The three included studies were undertaken in the Gulf of California (Mexico), Réunion Island, and The Nature Conservancy’s landholdings across the USA. The studies varied widely in context, purpose and outcomes. Study designs were non-experimental or qualitative, and involved use of spatial landholdings over time, stakeholder surveys and modelling of alternative planning scenarios. Conclusion: Rigorous evaluations of systematic conservation plans are currently not published in academic journals or made publicly available elsewhere. Despite frequent claims relating to positive implications and outcomes of these planning activities, we show that evaluations are probably rarely conducted. This finding does not imply systematic conservation planning is not effective but highlights a significant gap in our understanding of how, when and why it may or may not be effective. Our results also corroborate claims that the literature on systematic conservation planning is dominated by methodological studies, rather than those that focus on implementation and outcomes, and support the case that this is a problematic imbalance in the literature. We emphasise the need for academics and practitioners to publish the outcomes of systematic conservation planning exercises and to consider employing © The Author(s) 2018. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creat iveco mmons .org/licen ses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creat iveco mmons .org/ publi cdoma in/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Open Access Environmental Evidence *Correspondence: emma.mcintosh@ouce.ox.ac.uk 1 School of Geography and the Environment, University of Oxford, Oxford, UK Full list of author information is available at the end of the article Page 2 of 23 McIntosh et al. Environ Evid (2018) 7:22 Background Conservation planning is “the process of making informed conservation decisions” [1]. It is particularly important given the scale and complexity of policy and institutional agendas when it comes to spatially allocating resources. An approach to conservation planning that has received widespread interest for its evidence-based approach amongst academics [2, 3], conservation organisations [4, 5] and government departments [6–8] alike, is the discipline of systematic conservation planning [9]. Systematic conservation planning emerged in the 1980s and 1990s as a response to the tendency of conservation decisions to be made in an ad hoc manner. The majority of terrestrial protected areas were designated in places that were steeply sloped or otherwise unsuitable for agriculture, rather than where biodiversity was highest and most in need of protection [10, 11]. Systematic conservation planning built on ranking approaches, popular before the early 1980s, by incorporating quantifiable objectives and enabling assessments of trade-offs between competing conservation and cost-effectiveness considerations [12]. Ecological principles such as representation and persistence are central to systematic conservation planning [2]. Terminology has changed over time (for political expediency; Pressey, in prep) since the CAR (comprehensiveness, adequacy, representativeness) principles that first drove the discipline. Here, representation refers to the extent to which a set of reserves samples the full biodiversity of a region (combining both the original uses of comprehensiveness and representativeness). Persistence (originally framed in terms of adequacy) means the longterm survival of species or other elements of biodiversity, including diverse natural processes at a variety of scales [13], often approached through connectivity of multiple species and habitats across landscapes and seascapes. Systematic conservation planning proposes a structured, consultative process for choosing between, locating, configuring, and implementing conservation actions, often involving input from policy makers, land managers and resource users. Conservation objectives are specified quantitatively, in one of two ways [14]. First, and most commonly, objectives are expressed as threshold amounts of natural features relative to a baseline. An example is to cover at least 20% of each vegetation type, with no explicit added benefit for amounts over 20%. Second, objectives can be defined as continuous functions that accrue benefit up to 100% coverage of features. The outputs are optimal or near-optimal sets of spatiallybounded conservation actions [15, 16] (Fig. 1). During the planning process, spatial conservation prioritisations (also called conservation assessments) are conducted, often using decision support software such as Marxan [17], Marxan-with-Zones [18], Zonation [19], and C-Plan [20] (Fig. 1) to sort through the vast number of potential spatial configurations. While often equated with systematic conservation planning, prioritisations are analytical exercises making up only a subset of the overall process [16]. Rigorous evidence is central to systematic conservation planning. This generally includes spatial data for biodiversity or environmental surrogates [13] and socio-economic cost predictors such as land acquisition costs or willingness of landowners to support proposed conservation activities [21]. Plans often also include assessments of vulnerability to future climatic regimes [22] and analyses of scheduling conservation actions over time [23]. The discipline of systematic conservation planning has had a major influence on conservation planning practices globally. It is used extensively by environmental organisations and government agencies alike [1]. Thousands of academic publications focus on the discipline, a trend that appears to be increasing [2, 24]. Marxan alone had over 6700 users from 184 countries between 2011 and 2016 [25]. Efforts are currently underway to centralise records about where systematic conservation plans have been developed [26], as the number of total plans is currently unknown. This type of planning is resource intensive and can cost millions of dollars over several years [27]. Despite the influence of the discipline and the importance of evidence when developing plans, there is very little rigorous evidence about whether systematic conservation planning is effective at improving biodiversity conservation outcomes [28]. A compilation of such studies is therefore much needed. This study outlines the results of a systematic mapping exercise to comprehensively assess the published literature on the effectiveness of systematic conservation planning [15]. Systematic maps are typically precursors robust evaluation methodologies when reporting project outcomes. Adequate reporting of outcomes will in turn enable transparency and accountability between institutions and funding bodies as well as improving the science and practice of conservation planning.


Background
Conservation planning is "the process of making informed conservation decisions" [1]. It is particularly important given the scale and complexity of policy and institutional agendas when it comes to spatially allocating resources. An approach to conservation planning that has received widespread interest for its evidence-based approach amongst academics [2,3], conservation organisations [4,5] and government departments [6][7][8] alike, is the discipline of systematic conservation planning [9].
Systematic conservation planning emerged in the 1980s and 1990s as a response to the tendency of conservation decisions to be made in an ad hoc manner. The majority of terrestrial protected areas were designated in places that were steeply sloped or otherwise unsuitable for agriculture, rather than where biodiversity was highest and most in need of protection [10,11]. Systematic conservation planning built on ranking approaches, popular before the early 1980s, by incorporating quantifiable objectives and enabling assessments of trade-offs between competing conservation and cost-effectiveness considerations [12].
Ecological principles such as representation and persistence are central to systematic conservation planning [2]. Terminology has changed over time (for political expediency; Pressey, in prep) since the CAR (comprehensiveness, adequacy, representativeness) principles that first drove the discipline. Here, representation refers to the extent to which a set of reserves samples the full biodiversity of a region (combining both the original uses of comprehensiveness and representativeness). Persistence (originally framed in terms of adequacy) means the longterm survival of species or other elements of biodiversity, including diverse natural processes at a variety of scales [13], often approached through connectivity of multiple species and habitats across landscapes and seascapes.
Systematic conservation planning proposes a structured, consultative process for choosing between, locating, configuring, and implementing conservation actions, often involving input from policy makers, land managers and resource users. Conservation objectives are specified quantitatively, in one of two ways [14]. First, and most commonly, objectives are expressed as threshold amounts of natural features relative to a baseline. An example is to cover at least 20% of each vegetation type, with no explicit added benefit for amounts over 20%. Second, objectives can be defined as continuous functions that accrue benefit up to 100% coverage of features. The outputs are optimal or near-optimal sets of spatiallybounded conservation actions [15,16] (Fig. 1).
During the planning process, spatial conservation prioritisations (also called conservation assessments) are conducted, often using decision support software such as Marxan [17], Marxan-with-Zones [18], Zonation [19], and C-Plan [20] (Fig. 1) to sort through the vast number of potential spatial configurations. While often equated with systematic conservation planning, prioritisations are analytical exercises making up only a subset of the overall process [16].
Rigorous evidence is central to systematic conservation planning. This generally includes spatial data for biodiversity or environmental surrogates [13] and socio-economic cost predictors such as land acquisition costs or willingness of landowners to support proposed conservation activities [21]. Plans often also include assessments of vulnerability to future climatic regimes [22] and analyses of scheduling conservation actions over time [23].
The discipline of systematic conservation planning has had a major influence on conservation planning practices globally. It is used extensively by environmental organisations and government agencies alike [1]. Thousands of academic publications focus on the discipline, a trend that appears to be increasing [2,24]. Marxan alone had over 6700 users from 184 countries between 2011 and 2016 [25]. Efforts are currently underway to centralise records about where systematic conservation plans have been developed [26], as the number of total plans is currently unknown.
This type of planning is resource intensive and can cost millions of dollars over several years [27]. Despite the influence of the discipline and the importance of evidence when developing plans, there is very little rigorous evidence about whether systematic conservation planning is effective at improving biodiversity conservation outcomes [28]. A compilation of such studies is therefore much needed.
This study outlines the results of a systematic mapping exercise to comprehensively assess the published literature on the effectiveness of systematic conservation planning [15]. Systematic maps are typically precursors robust evaluation methodologies when reporting project outcomes. Adequate reporting of outcomes will in turn enable transparency and accountability between institutions and funding bodies as well as improving the science and practice of conservation planning. Keywords: Conservation assessment, Prioritisation, Resource allocation, Evidence synthesis, Protected areas, Implementation to systematic reviews, and involve collating, describing and assessing the quality of studies assessing a particular intervention [29]. To our knowledge, this is the first systematic map of a planning intervention in the environmental sciences, and it introduces a new set of challenges in evidence availability and interpretation. Interest in this topic is not new and this study follows a preliminary protocol for a systematic review lodged with the Collaboration for Environmental Evidence in 2008 [30].
Our conceptual understanding of how systematic conservation planning interventions can lead to outcomes in terms of natural, human, social, financial and institutional capital is illustrated in Fig. 2 and expanded in Table 1. This theory of change is deliberately simple, to illustrate the potentially broad range of outcomes currently assumed to result from systematic conservation planning exercises [28]. Multiple pathways and mechanisms are likely to link systematic conservation planning to these outcomes, but these are not yet consistently defined. Our decision to report outcomes by five types of capital follows an earlier application of this framework to conservation planning [31] and related disciplines, e.g. [32]. For further details, see our published protocol [15].

Stakeholder engagement
Subject experts (ranging from academic researchers to staff in environmental NGOs responsible for conducting systematic conservation plans) were consulted throughout the protocol development, searching and analysis stages. This occurred in several ways. Calls were put out for comment at relevant conferences and workshops, alongside presentations and posters about the project. Subject experts were also invited to share potentially relevant studies (see methods) and this usually led to email or phone discussions about the research and findings, as well as recommendations for additional contacts.

Objective of the systematic map
We sought to identify retrospective studies that measured the effects of systematic conservation planning exercises on biodiversity conservation at various scales. Our primary research question was: What is the extent and distribution of evidence on conservation outcomes of systematic conservation planning around the globe? The definitions used to focus our search are provided in Table 2.
Our intent was to categorise included studies using a data extraction framework. The framework was designed to explore the following secondary questions: • What are the characteristics of the current evidence base, including study location, scale and study design, intervention and outcome type? • What types of outcomes of systematic conservation planning exercises are measured, either by the original planning organisation(s) or others? • What types of study designs are used in evaluations of systematic conservation planning? • How robust is existing evidence? How many impact evaluations have been conducted, where and by whom?  Fig. 1 The stages of systematic conservation planning, reproduced from [15] and originally modified from [27]. Conservation planning approaches that are more systematic tend to follow these general steps. We define systematic conservation planning initiatives as those that also make use of spatial prioritisations and associated computational decision-support tools during stages 8 and 9 (boxed)

Methods
Our systematic map protocol has been published in Environmental Evidence [15] and this section includes updates since that publication. Updates include the use of the software CADIMA, consistency checks prior to every screening stage and abandonment of an attempt to undertake the bulk of screening by one reviewer (up to six were involved in the most time-consuming stages). Also, no evidence matrix was produced due to the small sample size. Further details have been provided in related sections of the methods.

Search strategy
A search string, consisting of subject-intervention-and outcome-related keywords combined using Boolean logic and wildcards, was used to query publication databases, search engines and online repositories (Table 3). Searches were conducted in February and March 2017. Unless otherwise indicated, searches were conducted for studies produced between 1983 and 2017 inclusive, in English only given resource constraints. Publications with fulltexts not in English were listed and recorded separately for future iterations of the map or for other interested parties to pursue (see "Results").
To ensure wide coverage of potential peer-reviewed academic publications and grey-literature publications [33], we searched three publication databases, one search engine, three online repositories, and 21 organisational websites (Table 4). Studies were also identified opportunistically, via calls for papers at major international conferences, backwards and forwards citation searches of included studies (in March 2018) and use of review papers to identify related primary studies. Subject experts were contacted to confirm no key references were missed. These individuals were identified primarily by their roles as conservation planning experts within global conservation NGOs, such as Conservation International, and prominent national organisations, such as the South African National Biodiversity Institute, and through snowball sampling (many recommended additional subject experts). Experts also included academics whom the authors knew had an interest in this topic. The detailed protocols employed to search each source, including tailored approaches to individual databases and suggestions from subject experts, are outlined in Additional file 1.

Data management
Following the searches, article citation details and abstracts were extracted in.ris form (or converted to.ris form using a freely accessible conversion template developed for our purposes by the EPPI-Centre [34]). When searching websites, the web scraping software Parsehub (https ://www.parse hub.com) was used to extract records [35,36]. Settings were tailored to each website and details are provided in Additional file 1.  [15]). A range of inputs influence the planning process. Outputs and outcomes can arise throughout the planning process but, for simplicity, we have lumped the main stages during the planning process. Different types of outputs from the planning process will lead to different types of outcomes. However, given the causal chains are not yet well understood, single arrows have been used to indicate the influence of the planning process on the types of potential outputs and outcomes. Outputs are the material or legal products of inputs, such as numbers or total km 2 of protected areas or numbers of boats available to patrol for illegal fishing. Outcomes are the observed or assumed effects of outputs, including reduced threat levels and improved state of biodiversity [58]. The feedback arrow from outcomes to inputs indicates the adaptive approach used to modify plans subject to observed outcomes Results were initially imported into the software EPPI-Reviewer (V.4.5.1.0 [37]), which was used to detect and remove duplicate articles. Prior to title screening, duplicate checked results were imported into the systematic review management software CADIMA [38].

Article screening and inclusion criteria
Articles were screened in three stages: title only, abstract, then full-text. When screening articles for relevance, a series of inclusion and exclusion decisions were consistently applied (Table 5). Regular reviewer team meetings were held throughout all screening stages to ensure criteria were being applied uniformly, and detailed decision rules were tailored for each screening stage in these meetings. During our discussions about definitions of systematic conservation planning and tightening the inclusion criteria, we revised our original positions and excluded several studies we had expected to include at the protocol development stage (from the test library, including [7,39,40]). If there was any doubt about the Table 1 Potential outcomes of systematic conservation planning arranged according to capitals. Categories adapted from the typology developed by Bottrill and Pressey [28] Representativeness was removed as an example of a natural capital outcome [58] Capital Definition Outcome sub-category

Natural
The stock and flow of goods and services provided by ecosystems, including the diversity of species, regulating processes, and supporting services [87] Reduction in loss or degradation of natural values

Persistence of biodiversity
Maintenance of ecosystem services

Financial
The gain or loss of cash, property or assets that represent the economic value of an individual or organization   relevance of an article, it was included for evaluation in the subsequent screening round to avoid removing potentially relevant studies. Two of the authors (MCM and RLP) have published extensively in this field so were not involved in article screening to reduce the likelihood of authors reviewing their own work. Prior to commencing each screening stage, interreviewer consistency checks were conducted with the reviewers involved at that stage. A subset of the same titles/abstracts/or full-texts were screened by all and results were compared using Cohen's Kappa (k) [41] (more details below). This exercise was repeated, and inclusion rules clarified, until the desired level of consistency was met (threshold k value of 0.5 at title screening and 0.7 for abstract and full-text screening). These k values exceed recommended guidelines [42]. After this, screening formally started (the articles used in consistency checks were included again in the overall screening). Articles were randomly divided between the reviewers and were not duplicate screened (except those marked as unsure, and at full-text extraction stage). Articles provided by subject experts were automatically included for screening at full-text.

Title screening
For title screening, four rounds of consistency checks were required, involving 200 randomly selected articles each time (800 titles total). Rounds were iteratively undertaken using CADIMA until screening decisions were above the k = 0.5 threshold. Publication titles were reviewed in CADIMA by two reviewers in accordance with the inclusion and exclusion criteria.

Abstract screening
Due to limitations of CADIMA (not allowing changes to the size of a reviewer team at different stages [38]), EPPI-Reviewer was used for abstract and full-text screening. Missing abstracts were entered for included articles. Six reviewers participated in abstract screening and five rounds of consistency checks were conducted, each involving 25 randomly selected articles, until results exceeded the k = 0.7 threshold (125 abstracts total).

Full-text screening
Full-text files were downloaded or accessed in hard copy through the Bodleian Libraries at the University of Oxford. Full-text articles were divided between and screened by three reviewers following six rounds of consistency checks (where 20 articles were screened by all three reviewers each round), until results exceeded the k = 0.7 threshold (120 full-texts). Reviewers included articles focusing on those with relevant subject, intervention, outcomes, comparator and study designs (Table 5). Articles were included only when all five criteria were relevant. Where insufficient information was provided to determine if the intervention met our definition of systematic conservation planning, further information was sought before reaching a decision, such as by following up cited references about the original plan. Any articles the reviewer was unsure about were flagged as 'unsure' and were discussed with the other reviewers prior to Table 3 Search terms and strings used for searching online databases and websites a Following sensitivity testing during initial pilots of the searches (in November 2016), the search terms were revised from the original protocol [15]. Terms were removed if they either inflated the number of search terms returned in an unspecified way (i.e. adding many irrelevant search results), or if they did not add value to searches based on similarity to existing terms Publication database search terms (formatted for Web of Science) a Subject terms AND Intervention terms AND Outcome terms AND Qualifier terms TS = (aquatic OR "river basin" OR ecoregion* OR bioregion* OR terrestrial OR marine OR freshwater OR coastal OR landscape OR seascape OR catchment OR "coastal zone" OR "ecological network" OR corridor OR "conservation area" OR "reserve network" OR "protected area" OR "national park" OR "planning unit") AND TS = ("conservation plan*" OR "spatial plan*" OR "conservation assessment" OR "reserve selection" OR "area selection" OR "reserve design*" OR "conservation zoning" OR "key biodiversity area" OR "important bird area" OR "spatial priorit*"OR "conservation priorit*" OR "conservation area priorit*" OR "spatial optimi*" OR "protected area network design" OR "resource allocation" OR "conservation decision making" OR marxan OR zonation OR "C-Plan" OR RobOff OR BioRap OR CLUZ OR ConsNet OR CPLEX OR CREDOS OR "Ecoseed Marzone" OR MinPatch OR MultCSync OR NatureServeVista OR ResNet OR SPEXAN OR "conservation evaluation" OR "area identification" OR "decision-support tool" OR "conservation action") AND TS = (outcome* OR evaluat* OR output* OR impact* OR effect* OR ineffective OR success* OR fail* OR benefit* OR implement*) AND TS = (biodivers* OR wildlife OR species OR habitat) Google Scholar search terms "Conservation plan" "conservation planning" "spatial plan" "conservation assessment" "reserve design" "conservation zoning"   Table 5 Article inclusion and exclusion criteria, summarised from the protocol [15] with practical considerations adopted during screening to increase clarity and consistency in the application of the criteria a A detailed discussion of what constitutes an outcome versus an impact is provided in [16] b In an interrupted time series, data are collected at several time points before and after an intervention [92] Screening This means that plans will necessarily use existing (e.g. Marxan [17], C-Plan [20] and Zonation [19]) or custommade (e.g. linear/non-linear programming, genetic algorithms) decision-support tools in the 'spatial prioritisation' stages Studies relating to plans that have no explicitly stated (or quantifiable) biological conservation objectives Studies relating to plans that were solely expert-based approaches Studies that do not involve the use of computerised decision-support tools Studies were included if they approximated the stages of systematic conservation planning in Fig. 1 (e.g. plans did not have to have been implemented), and involved stakeholder engagement, quantifiable conservation objectives, and a spatial prioritisation exercise Outcome Studies measuring changes in the condition of one or more of the following forms of capitals: natural, financial, social, human and institutional (either quantitatively or qualitatively) Broad interpretation of outcomes to capture the breadth of intended and unintended outcomes and potential flow-on consequences for biodiversity conservation a Outcomes that are not attributed to a systematic conservation planning process Studies were included if they reported on changes in the condition of one or more types of capital, as a result of a systematic conservation planning. Theoretical studies, prospective models, or studies using only ex-post modelling to estimate business as usual versus future planning scenarios were excluded, as were studies based on researcher inference

Comparator
Relevant study designs had to relate to the impact of a conservation action (e.g. baseline monitoring was not necessarily suitable) To distinguish gap analyses from impact evaluations, studies using measures of representativeness in a gap analysis scenario were excluded Opinions of the authors or unsubstantiated statements were treated as 'researcher inference' and excluded on study design reaching a decision. Given inconsistencies in the use of the term "systematic conservation planning" [16], those studies for which the "intervention was similar to, but not systematic conservation planning" (according to our definition, Table 5) were marked as such. This provided estimates of the number of spatial conservation prioritisations and other related exercises detected. Review papers were also set aside, and the studies therein were assessed for relevance.

Data coding and mapping
Relevant studies were extracted from the included articles (where multiple studies were included in a single article). For example, a single paper often reported on multiple planning instances. The included studies were categorised by two reviewers according to a data coding template (Additional file 2) and any differences in coding were discussed and a third reviewer involved if necessary (undertaking kappa analysis was unnecessary at this stage due to the small sample size). Coding involved outlining bibliographic details, background information including location of study and broad objectives of the plan, study design, reported outcomes and context [15]. Information about the study design of relevant articles was recorded to determine study robustness, in place of critical appraisal. Where possible, the categories of the data coding template were updated to match those underlying a new database of marine spatial prioritisations [26] to make the two databases cross-compatible. Where necessary, we contacted authors of included studies for additional information (via email or phone). Our finding of fewer than expected evaluations in the published literature meant the development of an evidence matrix and geographic map of included studies (as proposed in the protocol) was not possible. Instead we provided a narrative assessment of the available literature, gaps in our current state of knowledge and suggestions for how to fill these.
Procedural independence was managed by excluding co-authors with publication records in systematic conservation planning (MCM and RLP) from the screening process.

Primary findings
In total, 15,054 results were retrieved from the 29 sources searched, including 5228 duplicates ( Fig. 3; details in Additional file 1). A further 232 previously unscreened articles were screened based on forwards and backwards citation searches of the included studies (Additional file 3). After duplicates were removed, a total of 10,058 articles were screened by title, 5221 by abstract and 1209 by full-text. Of those included for full-text screening, 236 were either not in English or not accessible (Additional Reasons for the exclusion of the remaining 970 articles were recorded (Additional file 5). Seven articles were excluded outright because they were marked as reviews of systematic conservation planning evaluations (Additional file 6). If an article was excluded on one or more inclusion/exclusion criteria ( Table 5) it was not included. All criteria were assessed, and most articles were excluded based on multiple criteria (Fig. 3). 893 articles were excluded on intervention, 186 of which were deemed "intervention is similar to, but not systematic conservation planning", meaning they were either solely spatial conservation prioritisations (a computational component of systematic conservation planning), or approximated the stages involved in systematic conservation planning but did not involve the spatial conservation prioritisation stage [15].
Of the 69 studies included on subject and intervention (representing a systematic conservation planning intervention at a relevant scale, Table 5), 23 were excluded on outcome because they reported on the design and implementation of a systematic conservation plan but did not provide any details about consequences. In some cases, for example, at the time of publication it was too early to report whether the plan was implemented. This left 43 articles included on subject, intervention and outcome but excluded on study design and comparator. Only three articles were included based on all criteria (Additional file 7). One of these had been provided by a subject expert and the other two had been retrieved through publication database searches. One of the included articles reported on multiple different types of planning instances, but we extracted three relevant studies across the three articles.
Additionally, 142 articles were included on subject, study design and comparator but excluded on intervention and outcome, all of which related to formal evaluations of environmental management interventions, but not systematic conservation planning specifically.

Robustness of existing evidence
Coded data extracted from the three included studies is presented in Table 6 according to a standardised data coding template (Additional file 2). These impact evaluations were undertaken by NGOs and universities. In two studies, the same organisations undertook the evaluation as had developed the plan(s) in question.
Fisher and Dills [43] reported on ecoregional assessments conducted across the USA, which were based on systematic conservation planning principles [44]. The authors explored the relationship between terrestrial areas prioritized for biodiversity conservation by an environmental NGO (The Nature Conservancy, TNC), and those acquired for protection by the NGO over several decades.
Lagabrielle et al. [45] outlined a terrestrial planning process for the island of Réunion which was conducted in parallel with the revision of a regional development plan. The evaluation approach was largely reflexive, comparing a planning attempt involving systematic conservation planning (which they referred to as sequence 2) and another where agent-based modelling and companion modelling were used to explore future land-use change scenarios (sequence 3).
Álvarez-Romero et al. [46] compared seven marine conservation planning exercises undertaken over a 15-year period in the Gulf of California, Mexico. One of these plans met our definition of systematic conservation planning (the Ecoregional Assessment [47]) and it alone is the relevant study discussed here. Experts on regional marine conservation issues were surveyed and asked to identify planning goals, the extent to which these were achieved and how planning outputs influenced implemented conservation actions.

Characteristics of the current evidence base
Two studies were conducted at subnational scales and one nationally. The areas of interest ranged from 2500 km 2 to over 300,000 km 2 . Two studies concerned the terrestrial realm and one marine. We classified all three intervention types as aiming to 'identify priority conservation actions' . None were intended for direct application. The objectives of all studies included biodiversity, ecological processes and species persistence and two also included other considerations, such as fisheries, agriculture and urban planning. Stakeholder engagement most commonly involved consultation, and in one instance also negotiation. The duration of the planning processes and associated costs were unclear in all three studies.

Types of outcomes of systematic conservation planning exercises
All included studies reported institutional outcomes, two also reported on social and human outcomes, and only one reported any financial outcomes. None reported natural capital outcomes. Examples of outcomes included sharing of knowledge between stakeholder groups and a greater awareness of the complexity of urban planning amongst participants [45] and influence of the planning  "Biodiversity conservation and natural resource management: Promote a regional focus in marine coastal conservation and management; provide a detailed portfolio of priority areas that represent the diversity and distribution of species, natural communities, and ecological systems of the ecoregion. Also, contribute to the knowledge of biodiversity of marine and coastal environments, and facilitate the definition and implementation of conservation strategies" "In line with the current and future development challenges in Réunion Island, the operational objectives of this study were (i) to identify priority areas for conservation (ii) to provide guidelines for implementing conservation actions outside existing reserves while dealing with increasing pressuring factors in the lowlands; (iii) to "accompany" the conservation sector to negotiate land-use planning and decisionmaking, more particularly in relation to the new regional land-use plan and the management plan of the National Park, and (iv) to explore alternative scenarios for land-use and conserva-

Included study
Overview of the methodology "The lands acquired by The Nature Conservancy (TNC) were analysed using GIS to determine to what extent they were in areas defined as priorities for conservation" Seven plans conducted in the Gulf of California were compared and experts were asked to assess their outcomes based on a standardised questionnaire. "…The similarities and differences between planning exercises were examined in terms of data, methods and outputs, how identified priorities match the existing MPA system, and whether plans have guided conservation and management actions" The evaluation approach was largely reflexive, comparing planning sequences 2 (involving Marxan) and 3 (involving model co-creation with stakeholders), and based on "observations made by the participatory modelling investigators during and 12 months after the process". The authors considered the "researcher's posture in the participatory modelling process" and therefore attempted to recognise potential biases

Included study
Purpose/rationale for the study (stated reasons for undertaking an evaluation) "The Nature Conservancy (TNC) and other large conservation organizations have invested substantial resources in developing conservation plans intended to guide their decisions about which land areas and bodies of water to conserve. However, despite the investment in developing a scientific method for prioritizing areas for conservation, the degree to which land acquisition actually follows these scientific priorities has not been investigated before now" "While theory in conservation planning is developing quickly, there has been no assessment of the influence of new ideas on applications of marine conservation" "…The overall goal was to test different approaches to bridge the scientific and operational communities by bringing multidisciplinary scientists and stakeholders to collaborate around the participatory development of spatial models for land-use and conservation planning" Hypotheses of evaluators "Our first hypothesis was that overall the acquisition of lands should be well aligned with priority areas on the assumption that TNC chapters base their acquisition decisions on the best available conservation science. We did not expect perfect alignment for several reasons noted in the discussion section. Second, we hypothesized that there would be improvement over time in the match between science-based priorities and land protected by TNC as assessments and planning methods were increasingly formalized and improved. Our third hypothesis was that outright fee simple acquisition of land would show greater alignment with the priority areas than procuring conservation easements"

Not provided
Not provided Outcome pathways Theory of change or conceptual model (for how the plan was expected to lead to intended outcomes) included in the study?

No
No No process on future decision making by the organisation or partners [43,46]. Two of the three studies reported on whether the project outcomes reflected achievement of the original plan vision statement, and both reported the vision had been achieved.

Types of study designs used in evaluations of systematic conservation planning
None of the studies provided theories of change or a discussion of how they expected the plans to lead to potential conservation (or other) outcomes. Only one stated a hypothesis for the evaluation. Two studies involved nonexperimental study designs where the method of attribution was correlative. The other was qualitative, and attribution was based on researcher inference.

Review of our search methodology
One of the included studies [43] originated from the call to subject experts, but unlike the other included studies, had not been returned in the searches of publication databases (all included studies had been published in indexed journals). To explore the reasons for this, a review of the original search string was conducted in Web of Science Core Collections on 24/01/18 (Additional file 8). This confirmed the intervention and outcome search terms were appropriate, but the subject terms were not sufficiently diverse. By adding the term 'protection' to the subject terms, the missed article [43] was returned in the new search, along with 342 additional articles compared to the original search string (excluding a further 563 articles added to Web of Science in the 11 months since the original search). This finding can be used to help better design future searches (see "Discussion"). However, the 342 studies were not screened for relevance and none were included in the study since we would for parity necessarily have to have to repeated searches across all 29 sources beyond the original March 2017 cut off.

Discussion
Faced with prioritizing limited resources for biodiversity conservation, conservation planners are increasingly turning to systematic conservation planning tools and techniques. Their aims are to explore financially and socially acceptable trade-offs, whilst seeking to optimise representation and, usually implicitly, the persistence of species and habitats [2]. In this study, we collated articles on the application of systematic conservation planning and the outcomes of related plans. The aim was to assess the evidence base rather than produce metaanalyses or detailed syntheses. Despite retrieving over 10,000 articles from traditional academic and grey literature sources, only three studies were found to contain robust evaluations of this extensively applied intervention type and none that reported evaluation of natural capital outcomes. This highlights an important evidence gap, particularly given the amount of interest in systematic conservation planning and the significant cost of undertaking plans [28]. This is a null result, rather than a negative finding and was not completely unexpected, given the barriers to undertaking evaluations of complex interventions [48]. As stated in our original protocol, "a finding that no or few impact evaluations have been undertaken on systematic conservation plans would highlight an important gap in evaluations of the technique to date" [15: 4]. Rigorous Collaboration for Environmental Evidence guidelines [42] were used to identify three relevant studies, that provide valuable insights into how plans are conducted and how outcomes can be interpreted. However, the studies did not conform perfectly with our inclusion criteria, particularly in relation to comparators and study designs. The below considerations highlight the difficulties of interpreting studies of planning interventions, as well as the prevalence of incomplete records and challenges when attributing outcomes to complex interventions.

Contrasting evaluation methodologies
The three included study designs differed greatly and demonstrated the importance of understanding regional contexts and of interpreting results with care.
Fisher and Dills [43] undertook a meta-analysis, the assessment of a planning campaign over several decades, rather than assessments of individual plans and the causal processes that led to the outcomes of those plans. The authors were not able to provide conclusive evidence that systematic conservation planning influenced land acquisition decisions, but this finding masked the complexity underlying the value of the formal ecoregional assessment processes. While they did find that the land acquisition patterns by TNC were positively correlated with priority areas in all states across the USA, no difference was observed in land acquisition patterns before and after systematic ecoregional assessments were implemented.
Rather than a before/after analysis, Álvarez-Romero et al. [46] compared seven different approaches to planning in a single region, five of which approximated systematic principles although only one met our definition of systematic conservation planning. The plans overlapped spatially and temporally to some degree, and the authors employed both quantitative and qualitative data collection methods. The six plans that did not meet our definition of systematic conservation planning are not ideal counterfactuals for the relevant plan given that datasets and experiences from earlier plans often influenced subsequent planning processes. Furthermore, these plans were not explicitly contrasted to the relevant plan in the results, as data were aggregated across all seven plans. Use of a survey methodology facilitated an assessment of how goals, outcomes and spatial areas recommended for protection differed between the seven plans. Surveying people one step removed from the plans in question provides an example of how to limit bias when reporting on planning outcomes.
The third included study by Lagabrielle et al. [45] was considered borderline in terms of constituting a qualitative study design in part because it directly reported on the opinions of the authors, with little explanation of causal links or justifications for reported outcomes. We included it because the authors considered the "researcher's posture in the participatory modelling process" [45:1425] and therefore attempted to recognise potential biases. They provided thoughtful reflections on the scientists' role in the process, and comparisons between the experience of conducting a systematic conservation planning study using Marxan, with a process where stakeholders co-created models and planning outputs with scientists.
Another article we screened, by Carter et al. [49], provides a valuable example of how to conduct an evaluation of the outcomes of conservation plans, even though it was not included. The authors quantified land management actions (i.e., changes in the amount, location and land cover type of protected areas) in relation to individual state land management plans in Wisconsin, USA. They found that land protection activity increased in prioritized regions after plans were released, an effect that was high for local land protection projects but weak for broader, state-wide plans (a finding consistent with that of Fisher and Dills [43]). They also noted that most actions occurred within the first 5 years after a plan was released and decreased over time. Two plans discussed in this article were similar to systematic conservation plans but were excluded on intervention and outcomes. It remains an insightful article, exploring the causal pathways by which plans may influence conservation actions.
Overall, we identified 43 studies with relevant subjects, interventions and outcomes, but which we were unable to include due to unsuitable study designs and comparators (Additional file 2). In general, the evaluation strategies were not sufficiently rigorous for inclusion, or to independently verify claims. This is not uncommon in studies of environmental interventions [50,51]. Despite suggestions that conservation experts are unaware of how to conduct high quality evaluations [51], recent research suggests the main barriers to undertaking such studies relates to a lack of funding and time constraints, as well as availability of baseline data, lack of forward planning, and availability of a suitable control group [48].
Four studies were excluded by comparator only [7,31,52,53]. Authors of similar systematic maps have occasionally been flexible on screening by comparator [54], noting it is particularly unusual and challenging to employ use of comparators in multidimensional interventions such as planning. Instead we have been clear about the fact that our included studies do not perfectly match the inclusion criteria and list our reasons for exclusion (Additional file 5).

Lack of clarity around intervention definitions
The plans underlying all three included studies varied in the degree to which they met our definition of systematic conservation planning. For example, in Fisher and Dills' study [43] the methods underpinning TNC's ecoregional assessments [44] draw heavily from the systematic conservation planning literature. The use of spatial conservation prioritisations and computational decision-support tools is promoted by TNC, but is not mandatory, in contrast to our definition of this intervention (Table 5). We were not able to determine what proportion of ecoregional assessments in the included study involved the use of computational tools, and the authors acknowledged this type of information was not always available.
Very few full-text articles met our specific definition of systematic conservation planning (n = 80). Our definition is heavily drawn from the academic literature and may be better interpreted as a set of guidelines, rather than a distinct intervention. Groves and Game [1] recommend that conservation planners decide on which tools to include within a planning process depending on their specific needs, available funds and teams' skill sets. It appears that this is a better representation of how planning processes are conducted in practice than the 11-step framework we used to help define the intervention (Fig. 1). The latter may be better interpreted as a list of components from which planners pick and mix. However, expanding a study of this nature to include all conservation planning studies that are systematic to some degree would introduce high levels of variation, making comparisons extremely challenging.
For these reasons, it may be appropriate to review strict definitions of systematic conservation planning, including our own. Alternative conceptualisations of conservation planning as an intervention may be required. In addition, research is warranted on the relative benefits and limitations of following some, rather than all, planning stages of outlined in Fig. 1.

Predominance of publications on prioritisations rather than planning
This systematic map illustrates how rarely evaluations of systematic conservation plans are published. Yet, despite their novelty, none of the three included studies was published in the most common journals associated with the discipline. According to a review conducted in 2012 [2], the three journals with the greatest number of articles on systematic conservation planning are Biological Conservation, Conservation Biology and Biodiversity and Conservation. Citation searches in Google Scholar revealed our three included studies have been cited between 18 and 65 times over the 5-8 years they have been in publication. This is low in comparison with a seminal paper on systematic conservation planning, cited over 4500 times since 2000 [9].
Our results corroborate claims that the literature on systematic conservation planning is dominated by methodological studies, rather than those that focus on implementation and outcomes, and support the case that this is a problematic imbalance in the literature [55]. Spatial conservation prioritisations made up the majority of the 186 studies we marked as "intervention is similar to, but not systematic conservation planning". These were generally undertaken as hypothetical exercises or to propose a methodological innovation, rather than being linked to a broader planning and implementation strategy. Given estimates of the number of published prioritisations (e.g. 160 in the marine realm alone [26]), our finding of 186 related studies is lower than expected. This may be because we included evaluation terms in the search strings and excluded articles that were clearly only spatial prioritisations at the abstract screening stage.
Several authors acknowledged that they specifically decided not to conduct spatial conservation prioritisations as part of their (otherwise systematic) planning processes [56,57]. Kirlin et al. explained "this technique [use of Marxan software] was explicitly rejected in the initiative as inconsistent with the legal requirements… regarding network design and not sufficiently transparent to policy makers or stakeholders" [56:4]. Lagabrielle et al. also stated that they had to abandon a planned aspect of stakeholder consultation alongside their development of prioritisations in Marxan because "…this tool embeds strong hypothesis about land-use management and conservation, such as, for instance, the attribution of a value to biodiversity features. The participants globally disagreed with this approach and thus rejected the tool" [45:1424].
Another insight revealed in this study was the high number of plans that focused on reporting the representativeness of protected area networks (or similar) rather than measures of species' persistence or other measures of impact (and therefore had to be excluded). A protected area network might contain representations of each habitat type, but still fail to ensure species persistence, perhaps because of the inadequate size of, or connectivity between, habitat patches or inadequate representation of the habitats most in need of protection [58]. Too narrow a focus on representativeness in systematic conservation planning risks plans failing to secure healthy populations and species as intended.

Comparison with related reviews of systematic conservation planning
In a recent survey, conservation planning practitioners reported much higher rates of plan implementation and downstream outcomes than is observable in the published literature [59]. This finding is supported by unpublished outcomes provided by the lead author of one of the included studies "Our research worked so well (in terms of impacts) that I was recruited at the university of Réunion Island and the PhD co-student I worked with… is now in charge of developing and monitoring official regional land-use and risk management plans for the Regional Councile of Réunion Island" (Personal communication, Erwann Lagabrielle via email; 4 December 2017). Following up related studies (like Lestrelin et al. [60] in this case) can also indirectly demonstrate how institutional approaches to planning and stakeholder participation evolve following a systematic conservation planning exercise and how new knowledge and data can benefit subsequent decision-making. However, it is often necessary to ask authors which studies are directly linked.
In contrast to a recent review on the same topic [61], albeit more limited in scope, our results suggest there is insufficient evidence to claim whether systematic conservation plans are or are not achieving conservation goals. Through application of rigorous systematic mapping methodology, we identified two relevant terrestrial studies which the authors of that review did not appear to locate. However, we concur that more detailed examinations of how plans are implemented in specific regions would be useful and reaffirm statements made by Bottrill et al. in 2012 [31], "Empirical evidence is not available to support the belief in the benefits of planning".
Other systematic maps and reviews in the environmental sciences have also reported few or no relevant studies, despite also focusing on widely applied, and well-funded disciplines [62][63][64]. It is important to share such results to illuminate knowledge gaps, limitations with study quality and to avoid duplicating enquiries or assumptions [65].
It is useful for practitioners and decision makers to know when an intervention has not been evaluated to support claims of effectiveness. However, the lack of a consensus on the outcomes of systematic conservation planning does not imply there is no evidence at all. Evidence can take multiple forms, including anecdotal observations from subject experts. For example, the 43 studies which reported the outcomes of systematic conservation planning interventions but which did not meet our rigorous inclusion criteria for study design and/or comparator (Additional file 7) are still valuable to decision makers [66]. Examples of anecdotal outcomes included; "…the initial conservation planning map… has already been used…to block proposed afforestation permits that would have destroyed an area of high conservation value…" [67: 10], and "…pooling of resources, expertise, and capabilities was one of the enabling features in delivery of the new zoning plan…" [7: 1742].

Comprehensiveness of search
The three included studies were located from different sources, reinforcing the importance of a multi-pronged strategy. The fact that experts in the field confirmed they were not aware of other published studies provided further support for our results (Additional file 1). Therefore, we doubt the inclusion of additional sources, non-English publications or contact with more subject experts would have made a major difference to our results.
A test library of eight studies we had expected to be included was used in the design and testing of the search string (Additional file 1 in the protocol [15]). This test library was also used to test the comprehensiveness of the search results returned after sources were searched and duplicates removed. Six of the test library articles were returned by our searches, two were not. One was an organisational report [68] so does not appear to have been accessible through the grey literature sources we searched. The grey literature is necessarily harder to search (e.g. some databases we hoped to include did not facilitate the mass exporting of search results, Additional file 1). The other test library article [69] appears to have included subject terms other than those in our search string (this issue is addressed further below).
Future revisions to this search strategy should consider broadening the search string. A keyword we recommend for inclusion is 'ecoregion' or 'ecoregional assessment' , a term used by The Nature Conservancy to encompasses systematic conservation planning. These terms were trialled and not included in the search strings because they did not initially appear to contribute additionally to search results. Many ecoregional assessments were returned through our searches, but others may have been missed.
Limitations with our chosen subject terms apparently arose from the discipline of systematic conservation planning being relevant to a diverse array of subjects. Without significantly expanding the subject search terms to include a potentially limitless spatial terms (e.g. 'protection' , 'natural resource' , 'forest' , 'pasture' and so on), relevant studies may be missed. Excluding the qualifier search terms also works to broaden the results, but a careful balance is required to avoid returning too many irrelevant search results.
There is a small risk that literature that does not conform with the academic jargon around systematic conservation planning was overlooked during screening. However, our inclusion and exclusion criteria (Table 5) were specifically designed with this risk in mind, focusing instead on key characteristics of plan design and implementation.

Efficiency in the review process
During the process of conducting this review we experimented with various technological approaches to increase the efficiency of screening thousands of articles. This included trialling specialised software to manage the screening process [38]. Automated duplicate checking, and web scraping also led to significant time savings. Despite progress in the use of machine learning to help automate abstract screening [70,71], we concluded this technology is not sufficiently developed to have been applicable without further testing. Innovative methods are much needed, particularly as the size and scope of maps and reviews continue to increase [72,73].

Implications for policy and management
The lack of rigorous evidence for the impacts of systematic conservation planning is of considerable concern. Additional studies are urgently needed to understand the work of governments and environmental NGOs applying these methods around the globe. A strong assumption from theory is that systematic conservation planning is more effective at conserving species and habitats than alternative approaches to allocating resources for conservation (be they ad hoc, driven by extractive use considerations, or based primarily on stakeholder negotiation). Given the focus on evidence, it is consistent with the epistemology of the discipline to ask whether there is evidence for effectiveness, and to suggest ways in which such evidence might be made more easily available. However, there are many theoretical and practical reasons to believe systematic conservation planning is preferable to alternative approaches (or doing nothing) [74]. The race is on to protect sufficient areas of the land and ocean, increasing the importance of core systematic conservation planning principles like representation of species and habitats, clear objective-setting and well-designed stakeholder engagement [11].

Implications for research
In this study, we confirmed a growing evidence-base for the suitability of different methodological approaches for spatial conservation prioritisations [75,76]. We also found some evidence of how systematic conservation plans are being implemented, including lessons learnt [8,67,77,78]. However, rigorous impact evaluations are lacking. A full set of guidance is likely beyond the scope of this study, but some recommendations are provided below.
To improve the quality of future evaluations of conservation plans, we suggest conservation planning organisations provide incentives and time for staff to write up their findings and make them publicly available (even if not in an academic journal). In addition, conservation planners and academics would benefit from collaborations to leverage the additional resources and skills required to complete evaluation studies. Long-term ecological monitoring and reporting combined with adaptive management [79] and standardised metrics of management effectiveness [80,81] are extremely valuable. More examples of ways to improve evaluation in systematic conservation planning are offered by McIntosh et al. [16].
To improve project documentation, we suggest including: (a) explicit statements about the objectives of a systematic conservation planning process and quantitative or qualitative evidence for whether those objectives were met; and (b) detail about the intervention i.e., how the planning process was conducted, to enable readers to determine whether the process has the attributes of a systematic conservation planning project. To improve evaluation documentation, we suggest including: (a) theories of change about how specific aspects of the planning process were expected to lead to particular outcomes (and where possible, whether results led to a modified understanding of the theory of change); (b) clear descriptions of the study design (with reference to existing classifications e.g. experimental or qualitative sampling [82]) and discussion about any limitations with the chosen study design; and (c) where possible, presentation of a range of perspectives on the outcomes and potential causal links with the planning process.
When designing an evaluation in these contexts, it is preferable to focus on improving the minimum standards of evaluation, rather than expecting to achieve 'best practice' evaluation methodologies [48,83]. Based on our included studies and related literature on barriers to evaluation [48], the most practical counterfactual study designs for conservation planning are likely to include comparisons before/after the planning process.
As a minimum, authors should identify potential alternative explanations (external to the planning process) for observed outcomes and explain their understanding of the relative importance of different potential causal processes. Examples include stakeholder perceptions of the relative importance of a local election in promoting political action, or a natural disaster having reduced interest in planning versus on-ground action. For more see McIntosh et al. [16]. Given that systematic conservation planning constitutes a form of 'dynamic planning' [77], dynamic approaches to evaluation may also be necessary. This could include elements of development evaluation, where an evaluator is integrated with the design team [84].
These styles of studies are not impossible to undertake or report on. An increased uptake of before-after and with-without designs in the conservation policy field has been reported by conservation experts [48]. In one of our included studies, a temporal comparator was employed using historical land ownership records. This type of analysis is likely to be feasible in many conservation planning scenarios, particularly where the use of experimental study designs and controls would not be cost effective or ethical.
During full-text screening we encountered several interesting study designs and uses of comparators that have been undertaken in related disciplines. For example, qualitative case study evaluations compared different planning approaches in a single region [45,85], and another compared two neighbouring regions, one that participated in a community zoning project, and another that did not [86].
In the short term, it is unlikely that enough new studies will be produced to warrant a revision of this systematic map. Future research may consider exploring the evidence for the effectiveness of specific aspects rather than evaluating systematic conservation planning in its entirety (e.g., stakeholder attitudes of consultation processes).

Overall conclusion
Remarkably few rigorous evaluations of systematic conservation plans have been conducted to date, despite many claims about their effectiveness. This does not imply systematic conservation planning is, or is not, effective, but highlights the gap in our understanding of how, when and why it may or may not be effective. It also raises important questions about the challenges of conducting rigorous evaluations in relation to a non-linear and multi-dimensional intervention such as conservation planning. We have provided some suggestions as to how these challenges can be overcome. We recommend more focus is required in this area. We urge academics