33 research outputs found

    CoordinateCleaner: Standardized cleaning of occurrence records from biological collection databases

    Full text link
    © 2019 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society. Species occurrence records from online databases are an indispensable resource in ecological, biogeographical and palaeontological research. However, issues with data quality, especially incorrect geo-referencing or dating, can diminish their usefulness. Manual cleaning is time-consuming, error prone, difficult to reproduce and limited to known geographical areas and taxonomic groups, making it impractical for datasets with thousands or millions of records. Here, we present CoordinateCleaner, an r-package to scan datasets of species occurrence records for geo-referencing and dating imprecisions and data entry errors in a standardized and reproducible way. CoordinateCleaner is tailored to problems common in biological and palaeontological databases and can handle datasets with millions of records. The software includes (a) functions to flag potentially problematic coordinate records based on geographical gazetteers, (b) a global database of 9,691 geo-referenced biodiversity institutions to identify records that are likely from horticulture or captivity, (c) novel algorithms to identify datasets with rasterized data, conversion errors and strong decimal rounding and (d) spatio-temporal tests for fossils. We describe the individual functions available in CoordinateCleaner and demonstrate them on more than 90 million occurrences of flowering plants from the Global Biodiversity Information Facility (GBIF) and 19,000 fossil occurrences from the Palaeobiology Database (PBDB). We find that in GBIF more than 3.4 million records (3.7%) are potentially problematic and that 179 of the tested contributing datasets (18.5%) might be biased by rasterized coordinates. In PBDB, 1205 records (6.3%) are potentially problematic. All cleaning functions and the biodiversity institution database are open-source and available within the CoordinateCleaner r-package

    The curse of the uncultured fungus

    Get PDF
    The international DNA sequence databases abound in fungal sequences not annotated beyond the kingdom level, typically bearing names such as “uncultured fungus”. These sequences beget low-resolution mycological results and invite further deposition of similarly poorly annotated entries. What do these sequences represent? This study uses a 767,918-sequence corpus of public full-length fungal ITS sequences to estimate what proportion of the 95,055 “uncultured fungus” sequences that represent truly unidentifiable fungal taxa – and what proportion of them that would have been straightforward to annotate to some more meaningful taxonomic level at the time of sequence deposition. Our results suggest that more than 70% of these sequences would have been trivial to identify to at least the order/family level at the time of sequence deposition, hinting that factors other than poor availability of relevant reference sequences explain the low-resolution names. We speculate that researchers’ perceived lack of time and lack of insight into the ramifications of this problem are the main explanations for the low-resolution names. We were surprised to find that more than a fifth of these sequences seem to have been deposited by mycologists rather than researchers unfamiliar with the consequences of poorly annotated fungal sequences in molecular repositories. The proportion of these needlessly poorly annotated sequences does not decline over time, suggesting that this problem must not be left unchecked

    Decreased soil moisture due to warming drives phylogenetic diversity and community transitions in the tundra

    Get PDF
    Global warming leads to drastic changes in the diversity and structure of Arctic plant communities. Studies of functional diversity within the Arctic tundra biome have improved our understanding of plant responses to warming. However, these studies still show substantial unexplained variation in diversity responses. Complementary to functional diversity, phylogenetic diversity has been useful in climate change studies, but has so far been understudied in the Arctic. Here, we use a 25 year warming experiment to disentangle community responses in Arctic plant phylogenetic β diversity across a soil moisture gradient. We found that responses varied over the soil moisture gradient, where meadow communities with intermediate to high soil moisture had a higher magnitude of response. Warming had a negative effect on soil moisture levels in all meadow communities, however meadows with intermediate moisture levels were more sensitive. In these communities, soil moisture loss was associated with earlier snowmelt, resulting in community turnover towards a more heath-like community. This process of 'heathification' in the intermediate moisture meadows was driven by the expansion of ericoid and Betula shrubs. In contrast, under a more consistent water supply Salix shrub abundance increased in wet meadows. Due to its lower stature, palatability and decomposability, the increase in heath relative to meadow vegetation can have several large scale effects on the local food web as well as climate. Our study highlights the importance of the hydrological cycle as a driver of vegetation turnover in response to Arctic climate change. The observed patterns in phylogenetic β diversity were often driven by contrasting responses of species of the same functional growth form, and could thus provide important complementary information. Thus, phylogenetic diversity is an important tool in disentangling tundra response to environmental change.This study was supported by The Swedish Research Council FORMAS (No. 942-2015-1382 to RGB and 2016-01187 to MPB), The Swedish Research Council (No. 621-2014-5315 to RGB and No. 2015-04857 to AA), the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement (No: 657627 to MPB), BECC—Biodiversity and Ecosystem services in a Changing Climate, the Swedish Foundation for Strategic Research (AA), the Royal Botanic Gardens, Kew (AA), Qatar Petroleum (JMA), and Carl Tryggers Stiftelse för Vetenskaplig Forskning (JMA and MPB)

    Toward a Self-Updating Platform for Estimating Rates of Speciation and Migration, Ages, and Relationships of Taxa.

    Get PDF
    Rapidly growing biological data-including molecular sequences and fossils-hold an unprecedented potential to reveal how evolutionary processes generate and maintain biodiversity. However, researchers often have to develop their own idiosyncratic workflows to integrate and analyze these data for reconstructing time-calibrated phylogenies. In addition, divergence times estimated under different methods and assumptions, and based on data of various quality and reliability, should not be combined without proper correction. Here we introduce a modular framework termed SUPERSMART (Self-Updating Platform for Estimating Rates of Speciation and Migration, Ages, and Relationships of Taxa), and provide a proof of concept for dealing with the moving targets of evolutionary and biogeographical research. This framework assembles comprehensive data sets of molecular and fossil data for any taxa and infers dated phylogenies using robust species tree methods, also allowing for the inclusion of genomic data produced through next-generation sequencing techniques. We exemplify the application of our method by presenting phylogenetic and dating analyses for the mammal order Primates and for the plant family Arecaceae (palms). We believe that this framework will provide a valuable tool for a wide range of hypothesis-driven research questions in systematics, biogeography, and evolution. SUPERSMART will also accelerate the inference of a "Dated Tree of Life" where all node ages are directly comparable. [Bayesian phylogenetics; data mining; divide-and-conquer methods; GenBank; multilocus multispecies coalescent; next-generation sequencing; palms; primates; tree calibration.]

    Detailed structure-activity relationship of indolecarboxamides as H(4) receptor ligands

    No full text
    A series of 76 derivatives of the indolecarboxamide 1 were synthesized, which allows a detailed SAR investigation of this well known scaffold. The data enable the definition of a predictive QSAR model which identifies several compounds with an activity comparable to 1. A selection of these new

    Human umbilical vein versus heparin-bonded polyester for femoro-popliteal bypass: 5-year results of a prospective randomized multicentre trial.

    Get PDF
    Item does not contain fulltextPURPOSE: To compare long-term patency of Heparin-Bonded Dacron (HBD) and Human Umbilical Vein (HUV) vascular prostheses in above-knee femoro-popliteal bypass surgery. DESIGN: A prospective randomized multi-centre clinical trial. PATIENTS AND METHODS: Femoro-popliteal bypasses were performed in 129 patients between 1996 and 2001. After randomization 70 patients received an HUV and 59 an HBD prosthesis. Patients were followed up every three months during the first postoperative year and yearly thereafter. The median follow-up was 60 months (range 3-96 months). Graft occlusions were detected by duplex scanning, angiography or surgical exploration. RESULTS: The cumulative primary patency rates were 79%, 66% and 58% at 1, 3 and 5 years postoperatively. Primary patency rates for HUV were 74%, 64% and 58% at 1, 3 and 5 years and 84%, 68% and 58% for HBD, respectively (log-rank test, p=0.745). Overall secondary patency rates were 82%, 72% and 61% at 1, 3 and 5 years postoperatively. The overall cumulative limb salvage at 5 years follow-up was 89% (CI 80%-91%) and was not dependent on graft type. Smoking (p=0.019), number of patent crural arteries (p=0.030) and previous cerebro-vascular events (p=0.030) were significant predictors of graft occlusion. CONCLUSION: There was no difference in long-term graft performance between HUV and HBD for above knee infrainguinal bypass
    corecore