10 research outputs found
Recommended from our members
European project 'Species 2000 europa' and the species catalogue of the world
Recommended from our members
Species 2000: an architecture and strategy for creating the Catalogue of the World's Plants: abstract & power point presentation
The Catalogue of Life: Assembling data into a global taxonomic checklist
Producing a global taxonomic checklist of all species is essential for indexing biodiversity data, and for providing the basic knowledge needed to study, manage, and conserve biological diversity. The Catalogue of Life (CoL) aims to provide a global taxonomic checklist of all species, and includes 1.9 million species names in the 2019 annual edition. The task of assembling data into CoL is complex and requires reformatting data, quality assurance testing, and collaborating with data providers to resolve detected taxonomic conflicts. Global Species Databases (GSDs) are submitted in a wide variety of data formats to CoL by hundreds of taxonomic experts and institutions. Submitted data are reformatted to a standard data submission format: CoL Standard Dataset (ACEF), DarwinCore, or CoLDP. A series of standardized data integrity checks are run to detect and resolve frequently occurring data quality problems including character encoding corruption, non-Latin characters in scientific names, missing parents, duplicated and homonymic names within the GSD and among other GSDs, split taxonomic groups that have been assigned to multiple parent taxa, and other issues. The process and challenges of assembling data into the Catalogue of Life, and future directions of the project in migrating to CoL+ infrastructure will be discussed
Recommended from our members
ILDIS, Euro+Med and Species 2000 Europa: contribution to a taxonomic infrastructure for European plant genetic resources
Recommended from our members
World database of legumes - results from the first twenty years of the ILDIS project
Towards a Quality Assurance and Quality Control Mechanism for Species List Building
Catalogue of Life (COL) brings together the efforts and contributions of taxonomists from around the world, and addresses the needs of researchers, policy-makers, environmental managers and the wider public for a consistent, up-to-date and authoritative listing of all the world’s known species. The names that science gives to species are fundamental tools that allow us to refer to these units of biodiversity. Knowing the name for a species unlocks everything that has been learned about its biology, distribution and relevance to humans. Every day, taxonomists continue to publish new scientific names and refine our understanding of the world’s species.Over the past 10 years, game-changing progress has been made with the Catalogue of Life and its authoritative taxonomic name indexing services. Major steps were made towards content completion (over 2.1 million species out of all described 2.4 million are now documented and validated through COL (Bánki et al. 2023). The COL data infrastructure and services were rebuilt in collaboration with the Global Biodiversity Information Facility (GBIF). This resulted in the GBIF-COL ChecklistBank, an open global repository for publishing and sharing taxonomic and nomenclatural datasets (> 48,000 datasets, including over 160 Global Species Databases).The data managing/editing process underlying the integration of data into a single checklist (index) could be more transparent with clearly documented protocols and procedures, and is today far too time consuming. In the ChecklistBank infrastructure, there are thousands of data sources at our disposal and we need a powerful, semi-automated, community-expertise based, quality assurance process to warrant the best possible product for users of taxonomic checklists.COL aspires to document, with as much detail as possible, the current quality control and quality assurance processes as they relate to activities that occur at the following levels: 1) data providers, 2) pipeline and data conversions, 3) ChecklistBank infrastructure, and 4) checklist output. Ideally, this should happen throughout the consortium of partners that help the taxonomic community with mobilising and editing authoritative taxonomic data, including initiatives like Atlas of Living Australia, Catalogue of Life China, EDIT Platform for Cybertaxonomy, Integrated Taxonomic Information System, Freshwater Animal Diversity Assessment, TaxonWorks / Species Files, World Flora Online / Rhakhis, World Register of Marine Species / LifeWatch Aphia.In recent years more attention has been given to the principles of creating an authoritative list of the world's species (Garnett et al. 2020). COL has developed and adopted new criteria for the inclusion of taxonomic data sources in the COL Checklist (Hobern et al. 2021). And ChecklistBank, certainly provides more avenues for (automated) quality checks and procedures.But to guarantee the highest confidence in data, we are in need of a transparent and consistent data quality assurance and data quality control mechanism from the creation of taxonomic data, to data publishing, to the delivery into taxonomic data products and the global list of known species. We will present the current status of quality control and quality assurance processes in Catalogue of Life, and discuss what should constitute a quality control and quality assurance mechanism for species list building
Recommended from our members
Species 2000 & ITIS Catalogue of Life
A digital resource, which aims to become a comprehensive catalogue of all known species of organisms on Earth