11 research outputs found

    Visualizing natural history collection data provides insight into collection development and bias

    Get PDF
    Natural history collections contain estimated billions of records representing a large body of knowledge about the diversity and distribution of life on Earth. Assessments of various forms of bias within the aggregated data associated with specimens in these collections have been conducted across temporal, taxonomic, and spatial domains. Considering that these biases are the sum of biases across all contributing collections to aggregate datasets, the assessment of bias at the collection level is warranted. Interactive visualization provides a powerful tool for the assessment of these biases and insight into the historical development of natural history collections, providing context for where sources of bias may originate and developing historical narratives to clarify our understanding of our own knowledge about life on Earth. Here, I present a case study on using Sankey diagrams to illustrate the development of the entomology type collection at the Academy of Natural Sciences of Drexel University in Philadelphia, Pennsylvania with the hope that extensions of these practices among individual natural history collections are modified and adopted

    Current GBIF occurrence data demonstrates both promise and limitations for potential red listing of spiders

    Get PDF
    Conservation assessments of hyperdiverse groups of organisms are often challenging and limited by the availability of occurrence data needed to calculate assessment metrics such as extent of occurrence (EOO). Spiders represent one such diverse group and have historically been assessed using primary literature with retrospective georeferencing. Here we demonstrate the differences in estimations of EOO and hypothetical IUCN Red List classifications for two extensive spider datasets comprising 479 species in total. The EOO were estimated and compared using literature-based assessments, Global Biodiversity Information Facility (GBIF)-based assessments and combined data assessments. We found that although few changes to hypothetical IUCN Red List classifications occurred with the addition of GBIF data, some species (3.3%) which could previously not be classified could now be assessed with the addition of GBIF data. In addition, the hypothetical classification changed for others (1.5%). On the other hand, GBIF data alone did not provide enough data for 88.7% of species. These results demonstrate the potential of GBIF data to serve as an additional source of information for conservation assessments, complementing literature data, but not particularly useful on its own as it stands right now for spiders.Peer reviewe

    A protocol for reproducible functional diversity analyses

    Get PDF
    The widespread use of species traits in basic and applied ecology, conservation and biogeography has led to an exponential increase in functional diversity analyses, with > 10 000 papers published in 2010-2020, and > 1800 papers only in 2021. This interest is reflected in the development of a multitude of theoretical and methodological frameworks for calculating functional diversity, making it challenging to navigate the myriads of options and to report detailed accounts of trait-based analyses. Therefore, the discipline of trait-based ecology would benefit from the existence of a general guideline for standard reporting and good practices for analyses. We devise an eight-step protocol to guide researchers in conducting and reporting functional diversity analyses, with the overarching goal of increasing reproducibility, transparency and comparability across studies. The protocol is based on: 1) identification of a research question; 2) a sampling scheme and a study design; 3-4) assemblage of data matrices; 5) data exploration and preprocessing; 6) functional diversity computation; 7) model fitting, evaluation and interpretation; and 8) data, metadata and code provision. Throughout the protocol, we provide information on how to best select research questions, study designs, trait data, compute functional diversity, interpret results and discuss ways to ensure reproducibility in reporting results. To facilitate the implementation of this template, we further develop an interactive web-based application (stepFD) in the form of a checklist workflow, detailing all the steps of the protocol and allowing the user to produce a final 'reproducibility report' to upload alongside the published paper. A thorough and transparent reporting of functional diversity analyses ensures that ecologists can incorporate others' findings into meta-analyses, the shared data can be integrated into larger databases for consensus analyses, and available code can be reused by other researchers. All these elements are key to pushing forward this vibrant and fast-growing field of research.Peer reviewe

    Globally distributed occurrences utilised in 200 spider species conservation profiles (Arachnida, Araneae)

    Get PDF
    Background Data on 200 species of spiders were collected to assess the global threat status of the group worldwide. To supplement existing digital occurrence records from GBIF, a dataset of new occurrence records was compiled for all species using published literature or online sources, from which geographic coordinates were extracted or interpreted from locality description data. New information A total of 5,104 occurrence records were obtained, of which 2,378 were from literature or online sources other than GBIF. Of these, 2,308 had coordinate data. Reporting years ranged from 1834 to 2017. Most records were from North America and Europe, with Brazil, China, India and Australia also well represented.Peer reviewe

    A global phylogeny of butterflies reveals their evolutionary history, ancestral hosts and biogeographic origins

    Get PDF
    Butterflies are a diverse and charismatic insect group that are thought to have evolved with plants and dispersed throughout the world in response to key geological events. However, these hypotheses have not been extensively tested because a comprehensive phylogenetic framework and datasets for butterfly larval hosts and global distributions are lacking. We sequenced 391 genes from nearly 2,300 butterfly species, sampled from 90 countries and 28 specimen collections, to reconstruct a new phylogenomic tree of butterflies representing 92% of all genera. Our phylogeny has strong support for nearly all nodes and demonstrates that at least 36 butterfly tribes require reclassification. Divergence time analyses imply an origin similar to 100 million years ago for butterflies and indicate that all but one family were present before the K/Pg extinction event. We aggregated larval host datasets and global distribution records and found that butterflies are likely to have first fed on Fabaceae and originated in what is now the Americas. Soon after the Cretaceous Thermal Maximum, butterflies crossed Beringia and diversified in the Palaeotropics. Our results also reveal that most butterfly species are specialists that feed on only one larval host plant family. However, generalist butterflies that consume two or more plant families usually feed on closely related plants

    Occupancy–detection models with museum specimen data: Promise and pitfalls

    No full text
    Abstract Historical museum records provide potentially useful data for identifying drivers of change in species occupancy. However, because museum records are typically obtained via many collection methods, methodological developments are needed to enable robust inferences. Occupancy–detection models, a relatively new and powerful suite of statistical methods, are a potentially promising avenue because they can account for changes in collection effort through space and time. We use simulated datasets to identify how and when patterns in data and/or modelling decisions can bias inference. We focus primarily on the consequences of contrasting methodological approaches for dealing with species' ranges and inferring species' non‐detections in both space and time. We find that not all datasets are suitable for occupancy–detection analysis but, under the right conditions (namely, datasets that are broken into more time periods for occupancy inference and that contain a high fraction of community‐wide collections, or collection events that focus on communities of organisms), models can accurately estimate trends. Finally, we present a case study on eastern North American odonates where we calculate long‐term trends of occupancy using our most robust workflow. These results indicate that occupancy–detection models are a suitable framework for some research cases and expand the suite of available tools for macroecological analysis available to researchers, especially where structured datasets are unavailable

    Georeferencing for Research Use (GRU): An integrated geospatial training paradigm for biocollections researchers and data providers

    Get PDF
    Georeferencing is the process of aligning a text description of a geographic location with a spatial location based on a geographic coordinate system. Training aids are commonly created around the georeferencing process to disseminate community standards and ideas, guide accurate georeferencing, inform users about new tools, and help users evaluate existing geospatial data. The Georeferencing for Research Use (GRU) workshop was implemented as a training aid that focused on the creation and research use of geospatial coordinates, and included both data researchers and data providers, to facilitate communication between the groups. The workshop included 23 participants with a wide background of expertise ranging from students (undergraduate and graduate), professors, researchers and educators, scientific data managers, natural history collections personnel, and spatial analyst specialists. The conversations and survey results from this workshop demonstrate that it is important to provide opportunities for biocollections data providers to interact directly with the researchers using the data they produce and vice versa

    A global phylogeny of butterflies reveals their evolutionary history, ancestral hosts and biogeographic origins

    Get PDF
    International audienceButterflies are a diverse and charismatic insect group that are thought to have evolved with plants and dispersed throughout the world in response to key geological events. However, these hypotheses have not been extensively tested because a comprehensive phylogenetic framework and datasets for butterfly larval hosts and global distributions are lacking. We sequenced 391 genes from nearly 2,300 butterfly species, sampled from 90 countries and 28 specimen collections, to reconstruct a new phylogenomic tree of butterflies representing 92% of all genera. Our phylogeny has strong support for nearly all nodes and demonstrates that at least 36 butterfly tribes require reclassification. Divergence time analyses imply an origin ~100 million years ago for butterflies and indicate that all but one family were present before the K/Pg extinction event. We aggregated larval host datasets and global distribution records and found that butterflies are likely to have first fed on Fabaceae and originated in what is now the Americas. Soon after the Cretaceous Thermal Maximum, butterflies crossed Beringia and diversified in the Palaeotropics. Our results also reveal that most butterfly species are specialists that feed on only one larval host plant family. However, generalist butterflies that consume two or more plant families usually feed on closely related plants
    corecore