Biodiversity Data Journal : General research article
PDF
Integrating Biodiversity Data into Botanic Garden Collections
Thomas Horn
‡ Molecular Cell Biology, Botanical Institute, Karlsruhe Institute of Technology, Kaiserstraße 2, 76128 Karlsruhe, Germany
Open Access

Abstract

Background

Biological research always evolves around a form of life, an individual or a group that belongs to a certain taxonomic affinity. Species as fundamental entities of biodiversity are described, grouped and reorganized. The stream of research documentation is frequently interrupted by new discoveries, new technologies and subsequent new evidence on the delimitation of certain taxonomic groups. To recover all relevant information for future research, the assessment of botanic garden collections, the identification of potentially invasive taxa, the construction of priority lists or the development of DNA based species authentication, plant genetic resources of botanic gardens and similar facilities can be improved by connecting them to a dynamic web of knowledge.

New information

Scientific names extracted from seed catalogues provided by 135 botanic gardens were evaluated using the Encyclopedia of Life (EOL), The Catalogue of Life (COL) and The Plant List (TPL). 98.5% of taxon names were verified and discrepancies of taxonomic status among providers were evaluated. Retrieving geographic information COL appeared as most valuable source while EOL was a major contributor primarily of Asian and American occurrences. A compiled flora database made similar contributions as COL, EOL and TPL. Only 7 % of the verified names were found to be included in the International Union for Conservation of Nature (IUCN) Red List, including one officially "extinct" species (Euphrasia mendoncae Samp.) and three taxa (Bromus bromoideus (Lej.) Crép., Lysimachia minoricensis J.J.Rodr. and Mangifera casturi Kosterm.) with the status "extinct in the wild". As second most important factor for biodiversity loss, potential invasiveness was determined using the Global Invasive Species Information Network (GISIN) and Delivering Alien Invasive Species Inventories for Europe (DAISIE). Approximately 4 % of the verified names were found in GISIN, 577 names representing introduced (exotic) and 183 invasive taxa. According to DAISIE around 20 % of the verified names are representing European alien taxa. 15 of the 18 worst European invasive plant taxa also were found among the verified taxon names.

Keywords

Botanic Gardens, Conservation, Invasiveness, Encyclopedia of Life, Catalouge of Life, The Plant List, Biodiversity, Bioinformatics

Introduction

Before the introduction of binomial nomenclature scientific names were intended to combine identity and diagnostic description of a species. While dealing with a small group this may have been feasible, for the aim of cataloguing complex groups it proofed to be rather impractical. Starting with Specium Plantarium [Linné and Salvius 1753] a scientific name has been a unique label linked to a (morpho)species supported by a separate diagnostic description.

As consequence of progress in taxonomic research the arrangement of organisms is modified subsequently changing or invalidating a scientific name and creating discontinuities among documented empirical data. Thus, the history of a scientific name contains relevant information for future research projects. An entry point for discovering the history of a taxon is a current species checklist which includes synonyms that are the milestones along the road to the present name.

Data repositories

As to collate a uniform and validated index to the world's known species the Integrated Taxonomic Information System (ITIS), a partnership of federal agencies and other organizations from the United States, Canada and Mexico, with data stewards and experts from around the world and Species 2000, an autonomous federation of taxonomic database custodians, involving taxonomists throughout the world, joined forces to setup The Catalogue of Life (COL) in 2001 further developing it [DP 2009] up till today. As of October 2014 it contained 1.5 million accepted and provisionally accepted taxonomic names (341 thousand of Plantae). With a similar agenda but focused on plants the Royal Botanic Gardens Kew and the Missouri Botanical Garden enabled the creation of The Plant List (TPL) combining multiple checklist data sets held by these institutions and other collaborators in 2010. As of September 2013 and the release of version 1.1 TPL contained 1.3 million scientific plant names of which 351 thousand are accepted species names.

With sufficient information on the establishment of the accepted scientific name the story of a species unfolds and literature can be screened for relevant information, including names that are no longer in use. Beyond literature there are freely available scientific data repositories of different kinds containing additional information. Members of the International Nucleotide Sequence Database Collaboration (INSDC) have been collecting and providing sequence information for 30 years accumulating about 178 million sequences of 340 thousand species and infraspecific epithets. The Barcoding of Life Datasystems (BOLD) supporting the generation and application of DNA barcode data, as of May 2015 offering over 4 million DNA barcode sequences supporting specimen identification. The Global Biodiversity Information Facility (GBIF) provides a single point of access to more than 500 million records, shared freely by hundreds of institutions worldwide, making it the biggest biodiversity database on the Internet. Names are also critical when building priority lists, e.g. the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES), the International Union for Conservation of Nature (IUCN) or invasive species lists (e.g. Global Invasive Species Information Network, GISIN or Delivering Alien Invasive Species Inventories for Europe, DAISIE).

The vision of building a species database combining names with all kinds of useful data [Wilson 2003] was realized after E. O. Wilson’s 2007 TED Prize speech by the release of the Encyclopedia of Life (EOL) in February 2008. Connected to scientific names derived from different sources are common names, diagnostic descriptions, media (images, sounds, videos and maps), ecological, geographical, molecular and usage data.

All mentioned data providers offer a website where the user can search for respective information. Some of them also offer the possibility to retrieve information through an application programming interface (API). By using dynamic high-level general-purpose languages (e.g. Perl, Phyton, PHP and others) stakeholders can include respective data in their own (web) application. Additionally, scientists are able to retrieve and analyze data from many different taxa in little time by using either the API or third party software that facilitates the API (e.g. Banbury and O'Meara 2014).

Botanic Gardens

The primary source for all the described information are, nevertheless, physical beings - individual members of respective species that are found in their natural habitat, cultivated on experimental fields or in botanic gardens.

Using science as fundamental criteria for the definition of a botanic garden, the first European botanic gardens were established in the mid-16th century [Palmer 1985] and focused solely on the study of medicinal plants. The model of such a "physic garden" became popular and resulted in the recreation throughout Europe in the following decades. In 1621 the University of Oxford botanic garden (Oxford BG) was the first to be established in the United Kingdom (www.bgci.org/resources/history/).

Today the role of botanic gardens is much more diverse. Support of scientific research and economic endeavors (e.g. Centre of Economic Botany at Royal Botanic Gardens Kew, founded in 1847 [Desmond 1995]), involvement in education, public relations, improvement of human well-being [Waylen 2006] and plant conservation [Ashton 1988, Hurka 1994, Maunder et al. 2001, Donaldson 2009] are some examples.

Botanic Gardens Conservation International (BGCI) is the world authority on botanic gardens and plant conservation. It represents about 700 members, mostly botanic gardens, from 118 countries. A traditional practice among botanic gardens is the exchange of plant (genetic) resources by annually offering seed catalogues (indices seminum) from which other gardens can order to develop their own collection. This practice is believed to have started in the late 16 nth century at the Oxford BG [Aplin et al. 2007] and has recently been critically discussed [Aplin and Heywood 2008, Heywood 1964, Howard et al. 1964]. A more recent development came with the dawn of the information age. Some gardens now provide access to their botanic collections through a web interface and interested parties can request material for their collection or research online. As a result of the Convention on Biological Diversity (CBD) the International Plant Exchange Network (IPEN) was endorsed by the EU-Consortium and approved by EU-gardens as model to meet the requirements of the CBD on access to genetic resources and benefit-sharing [Robbrecht and Bogaerts 2004].

The possibility to develop a collection or scientifically utilize respective species without the need for expensive expeditions and simultaneously complying to the CBD is appealing. However, the exchange and cultivation of plant species also has less favorable consequences. The introduction of exotic species by botanic gardens has frequently been associated with the potential for escape and evolution of invasiveness [Hulme 2011, Maunder et al. 2001]. The tremendous ecological and economic effects invasive species have [Pimentel et al. 2000CBD 2001] stimulated governments around the world to establish preventive measures (e.g. European Union Regulation No. 1143/2014, United States Federal Law: National Invasive Species Act). Other consequences of cultivation that are of great importance for species conservation with the ultimate goal of reintroduction are a range of threats to the genetic structure of respective species [Maunder et al. 2004,Husband and Campbell 2004].

Aim of the study

The aim of this study was to gain an overview on the currently available genetic resources offered by botanic gardens using taxonomic names of seed catalogues, to gather relevant information (i.e. taxonomic status, IUCN Red List status, geographical distribution and invasiveness reports) and to evaluate the information and its sources in the context of improving facilities dealing with plant genetic resource.

Material and methods

Botanic gardens hold a vast array of genetic resources (BGCI. 2015. PlantSearch online database. Botanic Gardens Conservation International. Richmond, U.K.) that is freely accessible for research and education. I analyzed the content of electronically available seed catalogues (i.e. indices seminum, IS) from 135, mostly European, botanic gardens (Suppl. material 1, Fig. 2).

Figure 1.

Schematic representation of the analysis: All extracted taxon names were compiled into a unique name list (UNL). Based on this list taxon verification using EOL, COL and TPL was performed yielding separate unique identifier lists (UIL) containing all verified taxa with source specific identifiers. The UNL was also used for the retrieval of information on conservation (IUCN Red List), invasiveness (GISIN, DAISIE) and geographic distribution (Flora database). The UIL was used to retrieve information on geographic distribution directly via EOL, COL and TPL.

Figure 2.

Origin of the 135 botanic gardens (Suppl. material 1) of which seed catalogues were used to extract taxon names.

Data preparation

PDF documents were converted into XML, parsed to extract taxa names which then were saved into a local database (IS Taxa, Fig. 1). The parser was restricted to names of species, subspecies, varieties, forms, cultivars and hybrids of the form "Genus x species". In a preliminary study of names contained in seed catalogues data from TPL was stored in a local database. This database was complemented by importing all names available in csv format from the TPL website to be used in this study.

Taxonomic status of names and discrepancies among data providers

To verify the existence and to retrieve the taxonomic status of each name I used data from TPL (The Plant List Version 1.1, available from http://www.theplantlist.org/), EOL (Encyclopedia of Life. Available from http://www.eol.org. Accessed March 2015) and COL (Catalogue of Life. Available from http://www.catalogueoflife.org/, Roskov et al. 2014). EOL and COL provide application programming interfaces (API) that I used to access the data. TPL on the other hand does not offer a true API thus was accessed through a locally installed database.

Name status information was retrieved by respective API functions summarized in Table 1. Since the status terminology of data providers was different I mapped the returned values to the status scheme used by TPL (Table 2).

Table 1.

API functions and their data output format of respective data sources.

Source

API function

Output format

EOL

eol.org/api/search/1.0.json

JSON

eol.org/api/pages/1.0/[pageId].json JSON
eol.org/api/hierarchy_entries/1.0/[taxonConceptId].json JSON

TPL

local mysql database

SQL

COL

www.catalogueoflife.org/webservices/status/query/

XML

Table 2.

Statuses of taxon names provided by The Plant List (TPL) and other sources. *1 In case the taxon name searched for with EOL is returned in the title of the result, the name is the preferred name (set by the EOL curators). In case the name is returned in the content of the result, the name is considered a synonym.

TPL Status

Source Status

Source

Accepted

Preferred (Title*1)

EOL

Accepted name

COL

Provisionally accepted name

COL

Synonym

Not-Preferred (Content*1)

EOL

Synonym

COL

Unresolved

Misapplied

In detail, the retrieval of status information was done in two steps. First, only exact matches were used to retrieve the taxon status. In case of EOL the parameter exact was set to true. For CoL and TPL only exact matches are returned by default. In case more than one exact match was returned the respective status was set to “ambiguous”. Secondly, all names that did not return any result (NA) were re-evaluated. In case of EOL, firstly, the parameter exact was set to false (default) and, if still no result was returned, the name of the next higher hierarchical level (i.e. species in case of infraspecific epithets or genus in case of species names) was searched for. If found, the respective taxon page was used to retrieve all available names of children (i.e. infraspecifics or species) stored in hierarchies available through that page. If any of the returned names was an exact match or very similar (i.e. levenshtein distance < 3) it was retained as alternative. In case more than one possible alternative was found the name status was set to “ambiguous” unless the closest match was a perfect match (see discussion for details). If a single perfect match or alternative match was found the corresponding status was used as result. For CoL and TPL the same approach was realized using the API or mySQL queries to retrieve and test respective data.

For subsequent analyses the results of the taxon name status evaluation were merged and a list of unique names was compiled (Unique Name List, Fig. 1). In addition to taxonomic status the source's unique identifier for the taxon was stored for later use (Unique Identifier List, Fig. 1).

To evaluate status discrepancies between the sources, the results from all three data providers were combined. The status of each unique name was compared between pairs of data sources (i.e. TPL vs. COL, TPL vs. EOL, COL vs. EOL). The status was either identical or not. In the latter case the comparison was saved and used to evaluate discrepancies in more detail.

Conservation - IUCN Red List status

The list of unique verified taxon names (UNL, Fig. 1) was used to determine the IUCN Red List status for each taxon. First, plant data was downloaded from the IUCN website and installed in a local database. Second, the database was queried and respective status information was saved.

Geographic distribution

To compile an overview of the geographic distribution of available genetic resources, I used two approaches. Firstly, I used the list of verified taxon names with unique identifiers from TPL, EOL and COL (UIL, Fig. 1) to retrieve distribution data using the respective data source. Secondly I used the list of unique verified taxon names (UNL, Fig. 1) to query a locally compiled flora database. In both instances the aim was to assess the distribution of the respective taxon according to the continental scheme of the International Working Group on Taxonomic Databases For Plant Sciences (TDWG) [Brummitt 2001] while tropical and temperate Asia were combined to Asia. For the first approach this meant to parse respective text content delivered by data providers for the occurrence of geographical terms listed in the World Geographical Scheme for Recording Plant Distributions by the TDWG (see Table 1 - 5, Brummitt 2001).

The Plant List Distribution Data

In case of TPL only some of the data sources offer information about distribution that I could use for this analysis: If the TPL data source was either the World Checklist of Selected Plant Families (WCSP), the International Organization for Plant Information (IOPI) or iPlants, the stored identifier (UIL) was used to access taxon information at WCSP which is based on TDWG scheme. In case the TPL data source was the International Legume Database & Information Service (ILDIS), I retrieved the taxon report from their website by using the stored identifier and parsed for geographical records with respective region names. If Tropicos was the TPL data source I used their web service to retrieve distribution data of the stored identifier. For TPL data provided by the The International Compositae Alliance (TICA), a locally installed database containing occurrence information of respective taxa was used (data was kindly provided by Kevin Richards).

The Catalouge of Life and EOL Distribution Data

The web services of COL and EOL could be accessed directly using stored identifiers (UIL).

Flora DB Approach

As a second approach and to supplement the geographical information retrieved from the previously mentioned sources, I compiled a flora database using information of different available floras and regional name checklists (Table 3). Processing the names stored in the list of unique taxa (UNL), each occurrence in a certain flora or regional checklist was used as indicator for the distribution of the respective taxon.

Table 3.

List of sources for the compilation of a flora database.

Region

Name

Reference / Date of retrieval

Africa

African Plant Database

African Plants Database (version 3.4.0). Conservatoire et Jardin botaniques de la Ville de Genève and South African National Biodiversity Institute, Pretoria, "Retrieved August 2014", from <http://www.ville-ge.ch/musinfo/bd/cjb/africa/>.

Catalogue of the Vascular Plants of Madagascar

Tropicos , Madagascar Catalogue, 2013. Catalogue of the Vascular Plants of Madagascar. Missouri Botanic Garden, St. Louis, U.S.A. & Antananarivo, Madagascar [http://www.efloras.org/madagascar. Accessed: April, 2014].

Asia

Flora of China

Tropicos, botanical information system at the Missouri Botanic Garden - www.tropicos.org, Mai 2014

Flora of Taiwan Checklist

eFloras, August 2014

Ornamental Plants From Russia And Adjacent States Of The Former Soviet Union

eFloras, August 2014

Annotated Checklist of the Flowering Plants of Nepal (eFloras)

Tropicos classic, Mai 2014

Australasia

Australian Plant Name Index (APNI)

April 2014

New Zealand indigenous plant list

New Zealand Plant Conservation Network, 2010

Europe

World Checklist of Selected Plant Families (WCSP)

WCSP (2014). 'World Checklist of Selected Plant Families. Facilitated by the Royal Botanic Gardens, Kew. Published on the Internet; http://apps.kew.org/wcsp/ Retrieved August 2014.'

North America

Flora of North America

Tropicos, botanical information system at the Missouri Botanic Garden - www.tropicos.org, Mai 2014

South America

Flora of Chile

Tropicos, botanical information system at the Missouri Botanic Garden - www.tropicos.org, ?

Flora Brasiliensis

Centro de Referência em Informação Ambiental, CRIA, September 2014

Cubensis prima flora

Biodiversity Heritage Library, Biblioteca Digital del Real Jardin Botanico de Madrid

Catalogue of the Vascular Plants of Ecuador

Tropicos, botanic information system at the Missouri Botanic Garden - www.tropicos.org, April 2014

Invasiveness - GISIN and DAISIE

To determine the number and names of potentially invasive taxa cultivated by botanic gardens, I used the Global Invasive Species Information Network (GISIN) web service (http://www.gisin.org/GISIN/GISINWebService) to query each taxon name (UNL, Fig. 1) and retrieve occurrence datasets. If the dataset contained an entry that reported the taxon as exotic and harmful it was considered to be potentially invasive. Additionally, I compiled a database containing all Magnoliophyta, Gymnospemae, Pteridophyta and Bryophyta taxon names of DAISIE. Again, all unique verified names (UNL) were checked for occurrence in the local DAISIE database.

Results

Seed lists of 135 botanic gardens located in 124 European, 6 North American, 4 Asian countries and 1 South American country (Fig. 2) were used to extract taxonomic names. I was able to extract 58'990 names, comprising 14'795 unique species, 209 hybrids, 1'147 subspecies, 683 varieties, 85 forms and 1'892 cultivars which are derived from 3'260 genera and 296 families. After excluding all cultivars there were 16'224 taxa left for validation via EOL, COL and TPL. The assembled TPL name database contained 1'292'889 entries, 99.9 % of Version 1.1 with 1'293'685 name records.

Taxon name verification

In the exact match pass, 13'764, 12'939 and 14'338 names could be verified using COL, EOL and TPL respectively, constituting a success rate of 79.8 to 88.4 % (Fig. 3a). Searching for similar names using a levenshtein distance of < 3 and using non-exact search resulted in additional 566, 1'894 and 686 verified names raising the success rate to 88.3 to 92.6 %. 1'894, 1'391 or 1'200 names (7.4 to 11.7 %) could not be verified using COL, EOL and TPL respectively (Fig. 3b). All verified unique and unambiguous names with their identifiers (UIL, Fig. 1) for each of the sources (COL: 13'454, TPL: 14'469, EOL: 13'960) were used to retrieve information on the geographical distribution of respective taxa. Combining verified names of all three online sources and removing redundant names resulted in 15'973 unique entries (98.5 % of the original 16'224) which were used to retrieve information on conservation status, geographical distribution and invasiveness (UNL, Fig. 1).

Figure 3.

Results of the taxonomic name verification and status check (Suppl. materials 2, 3) using COL, EOL and TPL.Suppl. materials 2, 3

aOverview on the taxonomic name status (accepted, synonym, unresolved, ambiguous and NA = not available) of 16'224 unique taxon names retrieved from 135 indices seminum determined by inquiry using three online sources (TPL, EOL and COL). 
bOverview on the recovery of taxonomic names and their status by using levenshtein distance < 3 and data from three online sources (TPL, EOL and COL). 

Taxon name status discrepancies

Comparing the status information retrieved during name verification from each of the sources 3'552 and 3'997 discrepancies were found between TPL and COL, and TPL and EOL respectively, while only 1'816 discrepancies were detected between COL and EOL (Fig. 4a). Among the 3'552 and 3'997 discrepancies between TPL and both, COL and EOL, the majority consisted of names accepted by COL and EOL but being considered ambiguous (2'048, 1'834), synonyms (612, 547) or unresolved (424, 566) by TPL (Fig. 4b). On the other hand names considered to be synonyms by COL and EOL were found to be accepted (532, 408), unresolved (222, 149) or ambiguous (167, 177) by TPL (Fig. 4c). Lastly a minority of ambiguous COL and EOL names were found to be accepted (95, 184), synonyms (45, 117) or unresolved (9, 15) by TPL (Fig. 4d).

Figure 4.

Taxon name status discrepancies between TPL, EOL and COL (Suppl. material 4).

aComparison of status discrepancy: Comparison of TPL with COL and EOL (green), COL with EOL and TPL (blue) and EOL with COL and TPL (red) are represented by a triangle (center = 0 point, two sides with a length determined by the number of discrepancies with the source denoted in the corner and one side connecting these two). 
bDiscrepancy details. Showing the TPL name status for names considered to be accepted by EOL (left bar) and COL (right bar) 
cDiscrepancy details. Showing the TPL name status for names considered to be synonyms by EOL (left bar) and COL (right bar) 
dDiscrepancy details. Showing the TPL name status for names considered to be ambiguous by EOL (left bar) and COL (right bar) 

Conservation - IUCN Red List status

Out of the 15'973 unique taxon names (UNL), 1'066 (7 %) were found to be assessed by the IUCN Red List. One of the taxon names returned the status extinct (EX). Euphrasia minima Jacq. ex DC., according to IUCN a synonym for Euphrasia mendoncae Samp., can be found in the seed list of BGU Lautaret. The same name with different authority (Euphrasia minima Schleich.) can be found in the seed list of CJB Geneva. Three of the taxon names lead to the status extinct in the wild (EW): Bromus bromoideusLysimachia minoricensis and Mangifera casturi. 267 (1.7 %) names fell into one of the IUCN Red List threatened categories (vulnerable, endangered and critically endangered) and the remaining 795 included 84 near threatened (27 in the lower risk category), 6 lower risk conservation dependent, 620 least concern (39 on the lower risk category) and 85 data deficient taxon names (Fig. 5). All taxon names and their IUCN Red List status are compiled in Suppl. material 5.

Figure 5.

IUCN Red List Status of 1'076 taxon names found in seed catalouges of 135 botanic gardens (Suppl. material 5). Status categories include Extinct: Extinct (EX) and Extinct in the wild (EW); Threatened: Vulnerable (VU), Endangered (EN) and critically Endangered (CR); Others and Lower Risk (LR): Near threatened (NT,nt), Least Concern (LC,lc), Conservation dependent (cd); and Data deficient (DD)

Geographic Distribtion

To retrieve information about the geographic distribution using a locally installed flora database, a list of unique names (UNL, Fig. 1) containing 15'973 entries was used. For the retrieval of respective information using COL, TPL and EOL, unique identifier lists (UIL, Fig. 1), containing 13'454 (COL), 14'469 (TPL) and 13'960 (EOL) entries, were used. About 62 % of the unique name list entries were detected in one or more of the floras (Table 3) and their geographic distribution could be approximated. Among the three external sources COL returned data on the distribution of 87 % of the names. Considerably less success was achieved using EOL (46 %) and TPL (36 %). Merging all information into one dataset slighty increased the overall success to 88 % (Fig. 6).

Figure 6.

Geographic distribution of studied taxa (Suppl. materials 6, 7). Number and ratio of taxon names connected to distribution data accessible via EOL, TPL and COL, occurring in a locally compiled flora database (FloraDB) and found by the one or the other approaches (Merged).

Most of the evaluated taxa occur in Eurasia (Fig. 7b, c) followed by Africa and the Americas (Figs 7d, 8a, bFigs 7c, d, 8a) and Australasia and the Pacific (Fig. 8c, d). The distribution after merging all information is displayed in Fig. 7a. Taxa exclusively distributed in one of the regions are also mostly from Eurasia and only little from the Americas and remaining regions.

Figure 7.

Geographic (Europe, Asia, Africa, North America, South America, Australasia and Pacific) distribution of plant genetic resources according to Flora, EOL, TPL and COL information (Suppl. materials 6, 7). Part I.

aTaxa inclusively (dark) and exclusively (light) distributed in major regions according to combined flora, COL, EOL and TPL data.  
bTaxa inclusively (dark) and exclusively (light) distributed in Europe by flora, COL, TPL and EOL data. 
cTaxa inclusively (dark) and exclusively (light) distributed in Asia by flora, COL, TPL and EOL data. 
dTaxa inclusively (dark) and exclusively (light) distributed in Africa by flora, COL, TPL and EOL data. 
Figure 8.

Geographic (Europe, Asia, Africa, North America, South America, Australasia and Pacific) distribution of plant genetic resources according to Flora, EOL, TPL and COL information (Suppl. materials 6, 7). Part II.Suppl. materials 6, 7

aTaxa inclusively (dark) and exclusively (light) distributed in North America by flora, COL, TPL and EOL data. 
bTaxa inclusively (dark) and exclusively (light) distributed in South America by flora, COL, TPL and EOL data. 
cTaxa inclusively (dark) and exclusively (light) distributed in Australasia by flora, COL, TPL and EOL data. 
dTaxa inclusively (dark) and exclusively (light) distributed in the Pacific by flora, COL, TPL and EOL data. 

COL appears as major contributor of distributional data in case of most regions while EOL is a major contributor primarily of Asian and American occurrences. The flora database makes similar contributions as COL (European, African and Australasian occurrences), EOL (Asian occurrences) and TPL (American occurrences).

Invasiveness - GISIN and DAISIE

Of the 15'973 unique taxon names (UNL) 3.9 % (616) have records in the GISIN database. 577 of these taxa have been reported as exotic, 183 also as harmful (invasive) and 39 neither as exotic nor as harmful. All exotic and ivasive taxa are contained in Suppl. material 8. DAISIE offers information on 6'091 European alien plant taxa. Evaluating the UNL 19.6 % of the names (> 50 % of the DAISIE taxa) appear to be European alien taxa. DAISIE also offers a list of the 100 worst invasive aliens in Europe. Among these there are 18 terrestial plant taxa and 12 of these are found in the UNL. All detected DAISIE taxon names are contained in Suppl. material 9.

Miscellaneous

Of 15'973 unique taxon names 51 % (8'489) can be found in the BOLD taxonomy database. 7'691 of these taxon names also have one or more public barcode record registered with BOLD.

Discussion

Species, the units of biodiversity, have been described, named, grouped and rearranged for centuries. Scientific research has been recorded under one or another name depending on current opinion. Todays names have a history that comprises valuable hints for constructing a web of knowledge for public and scientific purposes. Names are fundamental for the correct determination of identity and for subsequent research that is based on the premise that data is obtained from a specific taxon.

Genetic Resources of botanic gardens

Botanic gardens cultivate and store many different plant species for scientific research and education. They should maintain a high degree of consistency with current systematic research and should have adequately documented specimens of correct species identity. Factors that most likely compromise these points are insufficient financial commitment and the decline of expertise in classical botany due to reduction or omission of respective topics and training in the biological curriculum.

An approach aiming to resolve this issue, promising specimen identification on the species level using a small standardized region of the (plant) genome [DNA barcoding sensu Hebert et al. 2003], so far, has been shown to be very limited in the plant kingdom [Taberlet et al. 2007, Seberg and Petersen 2009, Roy et al. 2010]. With their particular life style and high degree of flexibility plants have been regularly shown not to apply to the conventional biological species concept (sensu Mayr) causing considerable difficulties for any authentication method. The facts that for only 48 % of the analysed taxa at least one DNA barcode sequence exists and the offical plant barcoding markers [CBOL 2009] with considerable limitations in species level identification [Federici et al. 2013] still represent a significant part of those, are pressing reasons to either complement better suited sequence markers or to start looking for a different practical solution. The development of sequence markers based on amplified DNA fingerprinting fragments is still too laborious to be practical. Using highly species specific functional DNA fingerprinting patterns [Bardini et al. 2004, Breviario et al. 2007, Poczai et al. 2013] obtained by fast and reliable methods [Gavazzi et al. 2012], however, might be a feasible alternative.

Botanic gardens are in their nature places where many different species are brought together that under normal circumstances would never meet (artificial sympatry). Additionally their cultivation inevitably means that they are put into a novel environment. Both factors entail complex consequences that are relevant for our understanding of plant evolution and for conservation biological projects [Ashton 1988, Husband and Campbell 2004, Maunder et al. 2004, Schaal and Leverich 2004].

How can botanic gardens maintain a high degree of consistency with current systematic research and provide authentic genetic resources ? With internet access a collection can easily be connected to all available data and thus provide local staff and scientists alike with information of their particular interest (e.g. information on natural habitat, taxonomy, invasiveness, conservation status, all known common and scientific names, traditional usage, medicinal potential, etc). As demonstrated in this study in most cases the different sources are in agreement on taxonomic names but there is also a significant number of conflicting cases. Creating brought awareness of these cases increases the chance of research that, ultimately, will resolve conflicts. In this regard it is favorable to integrate taxonomic name status information that clearly indicates if a name should be used with caution (e.g. the status "Unresolved" of TPL) that at the same time indicates the need for further study.

Judging from the analysed seed lists there are still many botanic collections with specimens of mostly unknown origin. This fact alone reduces scientific value considerably [Böcher and Hjerting 1964]. Effects of cultivation in an artificial environment (selection, hybridization) using a small sample of the genetic structure (genetic bottleneck) of a species [Barrett et al. 1991, Maunder et al. 2004] cannot be traced and certain key aspects would have to be studied to ascertain, that the individual is still representative for the respective species. For that a new adequate [Brown and Briggs 1991] sample would be necessary that at the same time would render the former sample obsolete. To increase the scientific value of botanic collections accessions with unknown origin should if possible be replaced with adequately documented ones. Having worked with several plant genera obtained from different botanic gardens misidentifications or mix-ups appear to be as common as proposed in other publications [Hurka 1994] and pose another problem for education and research.

The namespace of EOL, COL and TPL

Results of taxon name verification and status check are based on data provided by third parties to one or more of the data portals used in this study. In case of EOL the name status is determined by EOL curators. A name as part of one or more hierarchies is either the preferred name or it is not. Only COL and TPL provide a name with a status that is based on the respective data source. According to TPL 98 % of all status values were directly derived from the data source that supplied the name record while only 2 % are a result of automated conflict resolution processes. Information provided by COL is based on global species databases that "have been validated for inclusion by independent peer reviews". How many peers are necessary to validate databases that contain names of thousands of plant species from hundreds of families? Is it the few or the many peers that bring about an objective assessment of taxonomy ?

The coverage of taxon names is highest (92 %) using the plant specific portal (TPL) closely followed by that of EOL (91 %) and COL (88 %). Using the levenshtein distance as recovery approach is adequate for simple typos. In case of more complex spelling variants of a name the fuzzy algorithm [Rees 2014] would be more appropriate. Most of the 11.6 % of names verified by EOL in the second round of the name analysis are not spelling variants but names that were not found due to the used exact search option. This can be explained by an inconsistent name field format. It does not only contain the taxonomic name, like in the case of COL and TPL, but in the mentioned instances also includes the authority of the name. The similarity of results between COL and EOL, particularly the lower level of status discrepancies, is based on the fact that EOL is using COL as its taxonomic backbone since its launch in 2008.

Considering name statuses TPL apparently offers a more heterogeneous view. The status "Unresolved" which in most cases indicates that the respective name status has not yet been assessed by the data provider is unique and makes TPL a distinguished source for plant names. In addition, a much higher degree of name ambiguity can be detected using TPL. About 4 - 6 times more names are considered ambiguous because of the existence of homonyms, many of which are not found using COL and EOL. The primary concern of this study was to find names and associated information. Having an exhaustive name space increases the number of hits when mining for data. This, however, could also result in a more contaminated dataset. Having complete information on a name, never the less, means being able to take all information into consideration when evaluating associated data.

Conserving names or species ?

It is curious to find a presumably extinct species (Euphrasia mendoncae Samp.) in a seed list of a botanic garden. The species was described as Portuguese endemic in 1936 and was never found again. IUCN therefore classed the species as extinct. Along with this information doubt on the identity of the portuguese taxon is expressed in the IUCN dataset and Euphrasia minima Jacq. ex Dc. is introduced as synonym of E. mendoncae. The presumed synonym E. minima Jacq. ex DC. appears in the IS of BGU Lautaret 2014, reporting the collection site at 2500 m height in Col du Galibier, France. While COL lists E. mendoncae Samp. as accepted name, TPL lists it as Unresolved, supporting the notion of insufficient data on the identity of the species described in 1936. While E. minima Jacq. ex DC. is not listet in TPL the name appeared as a synonym of E. officinalis subsp. officinalis L. in COL (2014) and in the meantime (2015) is classed as doubtful synonym of E. officinalis subsp. officinalis Jacq. ex DC.. This example demonstrates that names, as important as they are, sometimes are too ambiguous to be meaningful.

1.7 % of the taxon names surveyed fell into a IUCN threatened category. This proportion appears to be very low and might suggest failure of conservation initiatives that aimed to draw botanic gardens closer into the conservation network (e.g. IUCN / WWF Plants Conservation Program, 1984). However, conservation efforts are not expressed by the quantity of threatened taxa maintained by a botanic garden alone (For performance indicators for ex situ conservation facilities see Havens et al. 2006). Conservation efforts start with promoting awareness [Williams et al. 2015]. Awareness that human civilization does have a major impact on ecosystems which sustain, amongst ourselves, many different interconnected species. The destruction of these ecosystems not only could mean the loss of those species and eventually endanger ourselves, but could result in a loss of diversity that one day may have lead to the discovery of a cure for a disease, a future crop that has a natural tolerance or an exceptional nutritional value. Public opinion, education and awareness are key aspects for the establishment of an evolutionary ethic as part of our societies which would lead to a broad public recognition that the existence of other species is an integral part of our own [Hurka 1994, Frankel 1974].

Reasons for a low proportion of threatened taxa distributed by botanic gardens might be that the distribution itself is considered to be counter productive by leading to extreme genetic depletion (i.e. gardens prefer to order seeds instead of collecting them from natural habitats). Another point may be the necessary expertise that only through considerable commitment of time and resources can be established. Additionally due to a lack of sufficient documentation and prior knowledge of conservational aspects [Hurka 1994], present collections of botanic gardens are for the most part still meaningless for conservation purpose. Finally the fact that about 4600 (42 %) of European threatened taxa have been detected in European botanic garden collections [Sharrock and Jones 2011] but not in my study may indicate that most of these taxa are not distributed as seeds or most of the gardens holding the taxa are not included in my sample.

Geographic distribution

Depending on the source the distribution of an taxon can be determined on the continental, country or even regional level. Unfortunately the form of geographic information is not all consistent. Besides the TDGW scheme [Brummitt 2001] there is also simple text that if not combined with additional markers results in ambiguity (e.g. Georgia is either a state of the USA or a country located between the Black and Caspian sea). Another issue is the undetermined completeness of the information. Does a source contain complete information on the geographical distribution of a taxa or is it limited? There are no indicators to answer that question. Much detail of distribution is still hidden. Datasets gathered on the regional level (geobotanic mappings) or civil science reports on occurrences could be integrated into respective repositories (e.g. GBIF). Studies on distribution of certain taxa including community patterns will be possible as soon as coverage is sufficiently high and data is freely accessible through one of the sources mentioned earlier. Similarly, EOL's TraitBank [Parr et al. 2014] - a new text mining approach inferring traits (including distribution, habitat, elevation, interactions, etc.) from the occurrence of specific patterns (Ontology of Biological Attributes, Dietze et al. 2014) in content of the respective species enriches the web of species knowledge. While in the past one had to search and filter information in the form of books manually, consuming a lot of time and patience, today, thanks to the efforts of EOL and others that prepare and provide data on a freely accessible platform, we can answer questions more quickly and, more importantly, approach questions differently [Hart 2014, Wright and Seltmann 2014, Barve 2014].

Invasive Information

On the 1st of January 2015 an EU regulation on the prevention and management of the introduction and spread of invasive alien species (No. 1143/2014) came into force. It aims to address the adverse impact alien invasive species have on biodiversity, ecosystem services, human health and the economy in the EU Member States. Botanic gardens, without doubt, create artificial situations for species. Hybridization, as one possible consequence, has been shown to be an important factor in evolution [Whitham et al. 1994, Martinsen and Whitham 1994] and as promotor to invasiveness [Ellstrand and Schierenbeck 2000]. For the European Botanic Garden Consortium (EBGC) "it is vital that Botanic Gardens take steps to prevent future problem taxa from establishing through their collections" and direct stakeholders to "Initiatives such as Delivering Alien Invasive Species Inventories for Europe and North European and Baltic Network on Invasive Species as well as National Initiatives, such as Harmonia - Invasive species in Belgium" which provide detailed databases on respective species. One major problem mentioned is, that "it can be difficult for garden managers and curators to obtain summary lists that provide at a glance indications of problem taxa". This difficulty subsequently leads to seed lists containing potentially invasive species (e.g. Acacia dealbata Link , Ailanthus altissima (Mill.) Swingle, Echinocystis lobata (Michx.) Torr. & Gray., Rhododendron ponticum L.) without any note or indication. The identification of high-risk taxa is, however, the responsibility of those who introduce exotic species - may that be botanic gardens or the horticultural industry [Wittenberg and Cock 2004, Baskin 2002]. I demonstrated the integration of data from publicly available sources (GISIN and DAISIE) to provide at a glance indications of potentially invasive taxa. It would be possible to establish a publicly available web-tool that takes a simple list of taxa and in return produces a list of the supplied taxa and their invasive potential as demonstrated in this study (Suppl. material 8 and Suppl. material 9). Even more convenient would be if collection databases contained dynamic information provided by respective sources. Garden managers and scientists would be able to act with little latency evaluating and eliminating potential threats [Hulme 2014, Hulme 2011, Sharrock 2011Hulme 2011, Hulme 2014]. For this to become possible respective sources need to establish an API and collection managers or stakeholders should discuss the creation of an adminsitrative solution suitable for botanic gardens. Since many data providers already offer an API the most likely obstacle in implementing such a tool would be the transition of collection data from the "old" to the "new" system. In any case the "light" version offering a list matching service similar to that offered by COL for name statuses (http://www.catalogueoflife.org/content/list-matching-service) would still be an option and represent progress.

Conclusion

Botanic gardens represent one major sources of plant genetic resources. As to be useful for education or any other scientific research the material needs to be authentic and properly documented. Particularly in a highly specialized scientific world, where experience in the determination of species has become rare other methods need to be considered. The true (i.e. accepted) name of a taxon is essential in many ways. To verify the identity of a specimen the true name will lead to a diagnostic description. With a verified specimen studies can commence and DNA based authentication assays could be established. True names lead to alternative names that lead to additional information. To assess the conservational, horticultural or medicinal value of a collection true names lead to respective data that can be evaluated. To measure progress towards a certain goal only true names are relevant. In this study I evaluated names of taxa offered in seed catalogues demonstrating that there are still significant numbers of taxonomic issues which need to be resolved before true names will be available. Furthermore I demonstrated that The Plant List Version 1.1 is currently the most complete and informative checklist for plant species names. Providers of critical data (e.g. CITES, IUCN Red List, invasive or toxic species catalogues) should make sure to include a complete list of known names for a particular taxon to maximize data exposure.

With increasing level of publicly available data through portals like EOL and publishers supporting open data sharing [e.g. Smith et al. 2013, Kenall et al. 2014], a dynamic integration of this data will ultimately revolutionize the way botanic gardens promote their agendas and ecological and evolutionary science is approached.

Acknowledgements

Thanks goes to all mentioned people involved in the creation and maintenance of the used data portals as well as those standing behind the actual data providers, to all the botanic gardens as listed in Suppl. material 1 for providing the seed lists and finally to Prof. Peter Nick for supporting my work.

References

  • Aplin D, Heywood V (2008)
    Do Seed Lists have a future?
    Taxon
    57
    (
    3
    ):
    709
    711
    . https://doi.org/10.2307/27756702
  • Aplin D, Linington S, Rammeloo J (2007)
    Indices Seminum: Are They Really Worth The Effort?
    Sibbaldia Horticultural Journal
    (
    5
    )
    93
    108
    .
  • Ashton PS (1988) Conservation of Biological Diversity in Botanical Gardens.
    Biodiversity
    .
    9
    pp. [ISBN
    0-309-56736-X
    ].
  • Banbury B, O'Meara B (2014)
    Reol: R interface to the encyclopedia of life
    .
    4
    :
    2577
    2583
    . https://doi.org/10.1002/ece3.1109
  • Bardini M, Lee D, Donini P, Mariani A, Gianì S, Toschi M, Lowe C, Breviario D (2004)
    Tubulin-based polymorphism (TBP): a new tool, based on functionally relevant sequences, to assess genetic diversity in plant species.
    National Research Council Canada
    47
    :
    281
    291
    . https://doi.org/10.1139/g03-132
  • Barrett SCH, Kohn JR, Falk DA, Holsinger KE, o (1991) Genetic and evolutionary consequences of small population size in plants: implications for conservation.
    Genetics and conservation of rare plants.
    Oxford University Press
    ,
    27
    pp. [ISBN
    0-19-506429-1
    ].
  • Barve V (2014)
    Discovering and developing primary biodiversity data from social networking sites: A novel approach
    .
    24
    :
    194
    199
    . https://doi.org/10.1016/j.ecoinf.2014.08.008
  • Baskin Y (2002)
    The Greening of Horticulture: New Codes of Conduct Aim to Curb Plant Invasions
    .
    BioScience
    52
    (
    6
    ):
    464
    471
    . https://doi.org/10.1641/0006-3568(2002)052[0464:TGOHNC]2.0.CO;2
  • Böcher T, Hjerting JP (1964)
    Utilization of Seeds from Botanical Gardens in Biosystematic Studies
    .
    Taxon
    13
    (
    3
    ):
    96
    98
    . URL: http://www.jstor.org/stable/1216625
  • Breviario D, Baird WV, Sangoi S, Hilu K, Blumetti P, Gianì S (2007)
    High polymorphism and resolution in targeted fingerprinting with combined β-tubulin introns
    .
    Molecular Breeding
    20
    :
    249
    259
    . https://doi.org/10.1007/s11032-007-9087-9
  • Brown AHD, Briggs JD (1991) Sampling strategies for genetic variation in ex situ collections of endangered plant species.
    Genetics and conservation of rare plants
    .
    Oxford University Press
    ,
    20
    pp. [ISBN
    0-19-506429-1
    ].
  • Brummitt RK (2001)
    World Geographical Scheme for Recording Plant Distributions Edition 2
    .
    153
    pp. [ISBN
    091319672X
    ]
  • CBD S (2001)
    Assessment and management of alien species that threaten ecosystems, habitats and species. Abstracts of keynote addresses and posters presented at the sixth meeting of the Subsidiary Body on Scientific, Technical and Technological Advice, held in Montreal.
    CBD Technical Paper
    1
    :
    1
    135
    . URL: https://www.cbd.int/doc/publications/cbd-ts-01.pdf
  • CBOL PWG (2009)
    A DNA barcode for land plants
    .
    Proceedings of the National Academy of Sciences of the United States of America
    106
    (
    31
    ):
    12794
    12797
    . URL: http://www.pnas.org/content/106/31/12794.full
  • Desmond R (1995)
    Kew: the history of the Royal Botanic Gardens
    .
    Harvill Press
    ,
    466
    pp. [ISBN
    9781860465291
    ]
  • Dietze H, Berardini TZ, Foulger RE, Hill DP, Lomax J, Osumi-sutherland D, Roncaglia P, Mungall CJ (2014)
    TermGenie – a web-application for pattern-based ontology class generation
    .
    Journal of Biomedical Semantics
    5
    (
    48
    ):
    1
    13
    . URL: http://www.jbiomedsem.com/content/5/1/48
  • Donaldson J (2009)
    Botanic gardens science for conservation and global change
    .
    Trends in Plant Science
    14
    :
    608
    613
    . https://doi.org/10.1016/j.tplants.2009.08.008
  • DP G (2009)
    Towards a management hierarchy (classification) for the Catalogue of Life: Draft Discussion Document.
    Species 2000 & ITIS Catalogue of Life: 2009 Annual Checklist
    .
    Species 2000: Reading, UK
    ,
    1
    pp. URL: http://www.catalogueoflife.org/col/info/hierarchy
  • Ellstrand NC, Schierenbeck KA (2000)
    Hybridization as a stimulus for the evolution of invasiveness in plants?
    Proceedings of the National Academy of Sciences
    97
    (
    13
    ):
    7043
    7050
    . https://doi.org/doi: 10.1073/pnas.97.13.7043
  • Federici S, Galimberti A, Bartolucci F, Bruni I, De mattia F, Cortis P, Labra M (2013)
    DNA barcoding to analyse taxonomically complex groups in plants: The case of Thymus (Lamiaceae)
    .
    Botanical Journal of the Linnean Society
    171
    :
    687
    699
    . https://doi.org/10.1111/boj.12034
  • Frankel OH (1974)
    Genetic conservation: our evolutionary responsibility
    .
    Genetics
    78
    (
    1
    ):
    53
    65
    . URL: http://www.genetics.org/content/78/1/53
  • Gavazzi F, Casazza AP, Depedro C, Mastromauro F, Breviario D (2012)
    Technical improvement of the TBP (tubulin-based polymorphism) method for plant species detection, based on capillary electrophoresis
    .
    Electrophoresis
    33
    :
    2840
    2851
    . https://doi.org/10.1002/elps.201200144
  • Hart EM (2014)
    Using Encyclopedia of Life’s TraitBank to identify plant traits associated with vulnerability
    .
    PeerJ PrePrints 2:e491v1
    0
    :
    0
    . URL: https://peerj.com/preprints/491/
  • Havens K, Vitt P, Maunder M, Guerrant E, Dixon K (2006)
    Ex Situ Plant Conservation and Beyond
    .
    BioScience
    56
    (
    6
    ):
    525
    . https://doi.org/10.1641/0006-3568(2006)56[525:ESPCAB]2.0.CO;2
  • Hebert PDN, Cywinska A, Ball SL, DeWaard JR (2003)
    Biological identifications through DNA barcodes.
    Proceedings. Biological sciences / The Royal Society
    270
    :
    313
    321
    . https://doi.org/10.1098/rspb.2002.2218
  • Heywood VH (1964)
    Some Aspects of Seed Lists and Taxonomy
    .
    Taxon
    13
    (
    3
    ):
    94
    95
    . URL: http://www.jstor.org/stable/1216624
  • Howard R, Green P, Baker H, Yeo P (1964)
    Comments on " Seed Lists "
    .
    Taxon
    13
    (
    3
    ):
    90
    94
    . URL: http://www.jstor.org/stable/1216623
  • Hulme P (2014)
    Resolving whether botanic gardens are on the road to conservation or a pathway for plant invasions
    .
    Conservation Biology
    29
    (
    3
    ):
    816
    824
    . https://doi.org/10.1111/cobi.12426
  • Hulme PE (2011)
    Addressing the threat to biodiversity from botanic gardens
    .
    Trends in ecology & evolution
    26
    (
    4
    ):
    168
    174
    . URL: http://www.sciencedirect.com/science/article/pii/S0169534711000206
  • Hurka H (1994) Conservation genetics and the role of botanical gardens.
    Conservation Genetics
    .
    68
    .
    Birkhäuser Basel
    ,
    9
    pp. https://doi.org/10.1007/978-3-0348-8510-2_29
  • Husband BC, Campbell LG (2004) Population responses to novel environmnents: implications for ex situ plant conservation.
    Ex Situ Plant Conservation: Supporting Species Survival in the Wild
    .
    35
    pp. [ISBN
    9781559638753
    ].
  • Kenall A, Harold S, Foote C (2014)
    An open future for ecological and evolutionary data?
    BMC ecology
    14
    (
    1
    ):
    10
    . https://doi.org/10.1186/1472-6785-14-10
  • Linné Cv, Salvius L (1753)
    Caroli Linnaei Species plantarum
    .
    vol. 1
    .
    Impensis Laurentii Salvii
    ,
    560
    pp. URL: http://www.biodiversitylibrary.org/item/13829
  • Martinsen GD, Whitham TG (1994)
    More birds nest in hybrid cottonwood trees
    .
    The Wilson Bulletin
    106
    (
    3
    ):
    474
    481
    . URL: http://www.jstor.org/stable/4163447
  • Maunder M, Higgens S, Culham A (2001)
    The effectiveness of botanic garden collections in supporting plant conservation: A European case study
    .
    Biodiversity and Conservation
    10
    (
    3
    ):
    383
    401
    . https://doi.org/10.1023/A:1016666526878
  • Maunder M, Hughes C, Hawkins JA, Culham A (2004) Hybridization in Ex Situ Plant Collections: Conservation Concerns, Liabilities, and Opportunities.
    Ex Situ Plant Conservation: Supporting Species Survival in the Wild
    .
    3
    .
    Island Press
    ,
    39
    pp. [ISBN
    9781559638753
    ].
  • Palmer R (1985)
    Medical botany in northern Italy in the Renaissance.
    Journal of the Royal Society of Medicine
    78
    (
    2
    ):
    149
    157
    . URL: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1289588/
  • Parr CS, Wilson N, Leary P, Schulz KS, Lans K, Walley L, Hammock Ja, Goddard A, Rice J, Studer M, Holmes JTG, Corrigan RJ (2014)
    The Encyclopedia of Life v2: Providing Global Access to Knowledge About Life on Earth.
    e1079
    . https://doi.org/10.3897/BDJ.2.e1079
  • Pimentel D, Lach L, Zuniga R, Morrison D (2000)
    Environmental and Economic Costs of Nonindigenous Species in the United States
    .
    BioScience
    50
    (
    1
    ):
    53
    65
    . https://doi.org/10.1641/0006-3568(2000)050[0053:EAECON]2.3.CO;2
  • Poczai P, Varga I, Laos M, Cseh A, Bell N, Valkonen J, Hyvonen J (2013)
    Advances in plant gene-targeted and functional markers: a review
    .
    Plant Methods
    9
    (
    1
    ):
    6
    . https://doi.org/10.1186/1746-4811-9-6
  • Rees T (2014)
    Taxamatch, an Algorithm for Near (‘Fuzzy’) Matching of Scientific Names in Taxonomic Databases
    .
    PLoS ONE
    9
    (
    9
    ):
    e107510
    . https://doi.org/10.1371/journal.pone.0107510
  • Roskov Y, Abucay L, Orrell T, Nicolson D, Kunze T, Culham A, Bailly N, Kirk P, Bourgoin T, DeWalt RE, Decock W, De Wever A (2014)
    Annual Checklist
    .
    Species 2000 & ITIS Catalogue of Life
    0
    :
    0
    . URL: http://www.catalogueoflife.org/annual-checklist/2014/
  • Roy S, Tyagi A, Shukla V, Kumar A, Singh U, Chaudhary LB, Datt B, Bag S, Singh P, Nair N, Husain T, Tuli R (2010)
    Universal plant DNA barcode loci may not work in complex groups: A case study with Indian berberis species
    .
    PLoS ONE
    5
    (
    10
    ). https://doi.org/10.1371/journal.pone.0013674
  • Schaal B, Leverich WJ (2004) Population Genetic Issues in Ex Situ Plant Conservation.
    Ex situ plant conservation: supporting species survival in the wild
    .
    3
    .
    Island Press
    ,
    18
    pp. [ISBN
    9781559638753
    ].
  • Seberg O, Petersen G (2009)
    How many loci does it take to DNA barcode a crocus?
    PLoS ONE
    4
    (
    2
    ):
    2
    7
    . https://doi.org/10.1371/journal.pone.0004598
  • Sharrock S, Jones M (2011)
    Saving Europe’s threatened flora: progress towards GSPC Target 8 in Europe
    .
    Biodiversity and conservation
    20
    (
    2
    ):
    325
    333
    . URL: http://link.springer.com/article/10.1007%2Fs10531-010-9912-z
  • Sharrock SL (2011)
    The biodiversity benefits of botanic gardens
    .
    Trends in Ecology and Evolution
    26
    (
    9
    ):
    433
    . URL: http://www.sciencedirect.com/science/article/pii/S0169534711001418
  • Smith V, Georgiev T, Stoev P, Biserkov J, Miller J, Livermore L, Baker E, Mietchen D, Couvreur T, Mueller G, Dikow T, Helgen KM, Frank J, Agosti D, Roberts D, Penev L (2013)
    Beyond dead trees: integrating the scientific process in the Biodiversity Data Journal
    .
    Biodiversity Data Journal
    1
    :
    e995
    . https://doi.org/10.3897/BDJ.1.e995
  • Taberlet P, Coissac E, Pompanon F, Gielly L, Miquel C, Valentini A, Vermat T, Corthier G, Brochmann C, Willerslev E (2007)
    Power and limitations of the chloroplast trnL (UAA) intron for plant DNA barcoding
    .
    Nucleic Acids Research
    35
    (
    3
    ). https://doi.org/10.1093/nar/gkl938
  • Waylen K (2006)
    Botanic Gardens: Using biodiversity to improve human well-being
    .
    IUCN: Medicinal Plant Conservation
    12
    :
    4
    8
    . URL: https://cmsdata.iucn.org/downloads/mpc12.pdf
  • Whitham TG, Morrow PA, Potts BM (1994)
    Plant hybrid zones as centers of biodiversity: the herbivore community of two endemic Tasmanian eucalypts
    .
    Oecologia
    97
    (
    4
    ):
    481
    490
    . URL: http://www.jstor.org/stable/4220647
  • Williams SJ, Jones JPG, Gibbons JM, Clubbe C (2015)
    Botanic gardens can positively influence visitors’ environmental attitudes
    .
    Biodiversity and Conservation
    24
    (
    7
    ):
    1
    12
    . URL: http://link.springer.com/article/10.1007%2Fs10531-015-0879-7
  • Wilson EO (2003)
    The encyclopedia of life
    .
    18
    :
    77
    80
    . https://doi.org/10.1016/S0169-5347(02)00040-X
  • Wittenberg R, Cock JWM (2004)
    Invasive Alien Species: A Toolkit of Best Prevention and Management Practices.
    xvii - 228
    pp. [ISBN
    0851995691
    ]
  • Wright C, Seltmann K (2014)
    Usage patterns of blue flower color representation by Encyclopedia of Life content providers
    .
    2
    :
    e1143
    . https://doi.org/10.3897/BDJ.2.e1143

Supplementary materials

Suppl. material 1: List of botanic gardens
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing all botanic gardens included in this study

Suppl. material 2: Results of the exact taxonomic name check
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing name status information of 16'224 taxonomic names retrieved by exact match inquiry from EOL, TPL and COL.

Suppl. material 3: Results of the non-exact taxon name check
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing name status information of names that were not found at EOL, TPL and COL using exact name search.

Suppl. material 4: Discrepancies of taxonomic name status between providers
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing discrepancies of pair-wise comparison of taxonomic name status information provided by EOL, TPL and COL

Suppl. material 5: List of taxa with IUCN Red List status information
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing taxon names with IUCN Red List status of taxa detected in seed catalogues of botanic gardens

Suppl. material 6: Results of the distribution inquiry using EOL, TPL and COL
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing information on geographic distribution of plant gentic resources distributed by botanic gardens. Retrieved from EOL, TPL and COL.

Suppl. material 7: Results of the distribution inquiry using EOL, TPL and COL (merged)
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing information on geographic distribution of plant gentic resources distributed by botanic gardens. Merged data retrieved from EOL, TPL and COL.

Suppl. material 8: List of taxa with GISIN status
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing taxon names and GISIN status information of exotic and invasive (exotic and harmful) taxa detected in seed catalogues of botanic gardens

Suppl. material 9: List of European alien taxa
Authors:  Thomas Horn
Data type:  table
Brief description: 

Table containing taxon names of European alien species detected in seed catalogues of botanic gardens