69 research outputs found
Temporal changes in size distributions of the Southern Ocean diatom Fragilariopsis kerguelensis through high-throughput microscopy of sediment trap samples
Some aspects of the life cycle of the Southern Ocean diatom Fragilariopsis kerguelensis have been investigated previously, but many of its details have not been surveyed in nature. We investigated material from a two-year sediment trap time series by high-throughput imaging and image analysis, looking for morphometric signals of life cycle stages. Valve length distributions appeared close to unimodal but positively (right-) skewed. Size cohorts resulting from synchronized sexual reproduction events were not clearly distinguishable. Nevertheless, based on changes in valve length distributions, we found three general seasonal phases. These corresponded to periods of proliferation (with higher proportions of smaller cells during late spring/early summer), cessation of growth (relative loss of smaller cells during late summer/early autumn), and overwintering (little change in size distributions, with an increased proportion of large cells). We discuss possible causes of these signals, and their relevance to growth, sexual activity and adaption to environmental conditions, such as grazing pressures and the need for an overwintering strategy.This work was supported by the Deutsche Forschungsgemeinschaft (DFG) in the framework of the priority programme 1158
‘Antarctic Research with comparative investigations in Arctic
ice areas’ under grant nr. BE4316/4-1, KA1655/3-1; and by an
outgoing scholarship, as well as travel expenses, granted by
the Helmholtz Graduate School for Polar and Marine Research
(POLMAR). Part of this work was supported by the Australian
Government’s Australian Antarctic Science Grant Program under
project number 4078, and Macquarie University (A. RigualHernández and L. Armand)
Repositories for Taxonomic Data: Where We Are and What is Missing
AbstractNatural history collections are leading successful large-scale projects of specimen digitization (images, metadata, DNA barcodes), thereby transforming taxonomy into a big data science. Yet, little effort has been directed towards safeguarding and subsequently mobilizing the considerable amount of original data generated during the process of naming 15,000–20,000 species every year. From the perspective of alpha-taxonomists, we provide a review of the properties and diversity of taxonomic data, assess their volume and use, and establish criteria for optimizing data repositories. We surveyed 4113 alpha-taxonomic studies in representative journals for 2002, 2010, and 2018, and found an increasing yet comparatively limited use of molecular data in species diagnosis and description. In 2018, of the 2661 papers published in specialized taxonomic journals, molecular data were widely used in mycology (94%), regularly in vertebrates (53%), but rarely in botany (15%) and entomology (10%). Images play an important role in taxonomic research on all taxa, with photographs used in &gt;80% and drawings in 58% of the surveyed papers. The use of omics (high-throughput) approaches or 3D documentation is still rare. Improved archiving strategies for metabarcoding consensus reads, genome and transcriptome assemblies, and chemical and metabolomic data could help to mobilize the wealth of high-throughput data for alpha-taxonomy. Because long-term—ideally perpetual—data storage is of particular importance for taxonomy, energy footprint reduction via less storage-demanding formats is a priority if their information content suffices for the purpose of taxonomic studies. Whereas taxonomic assignments are quasifacts for most biological disciplines, they remain hypotheses pertaining to evolutionary relatedness of individuals for alpha-taxonomy. For this reason, an improved reuse of taxonomic data, including machine-learning-based species identification and delimitation pipelines, requires a cyberspecimen approach—linking data via unique specimen identifiers, and thereby making them findable, accessible, interoperable, and reusable for taxonomic research. This poses both qualitative challenges to adapt the existing infrastructure of data centers to a specimen-centered concept and quantitative challenges to host and connect an estimated 2 million images produced per year by alpha-taxonomic studies, plus many millions of images from digitization campaigns. Of the 30,000–40,000 taxonomists globally, many are thought to be nonprofessionals, and capturing the data for online storage and reuse therefore requires low-complexity submission workflows and cost-free repository use. Expert taxonomists are the main stakeholders able to identify and formalize the needs of the discipline; their expertise is needed to implement the envisioned virtual collections of cyberspecimens. [Big data; cyberspecimen; new species; omics; repositories; specimen identifier; taxonomy; taxonomic data.]</jats:p
Quantitative comparison of taxa and taxon concepts in the diatom genus <i>Fragilariopsis</i>: a case study on using slide scanning, multiexpert image annotation, and image analysis in taxonomy
Semiautomated methods for microscopic image acquisition, image analysis, and taxonomic identification have repeatedly received attention in diatom analysis. Less well studied is the question whether and how such methods might prove useful for clarifying the delimitation of species that are difficult to separate for human taxonomists. To try to answer this question, three very similar Fragilariopsis species endemic to the Southern Ocean were targeted in this study: F. obliquecostata, F. ritscheri, and F. sublinearis. A set of 501 extended focus depth specimen images were obtained using a standardized, semiautomated microscopic procedure. Twelve diatomists independently identified these specimen images in order to reconcile taxonomic opinions and agree upon a taxonomic gold standard. Using image analyses, we then extracted morphometric features representing taxonomic characters of the target taxa. The discriminating ability of individual morphometric features was tested visually and statistically, and multivariate classification experiments were performed to test the agreement of the quantitatively defined taxa assignments with expert consensus opinion. Beyond an updated differential diagnosis of the studied taxa, our study also shows that automated imaging and image analysis procedures for diatoms are coming close to reaching a broad applicability for routine use.Facultad de Ciencias Naturales y Muse
Quantitative comparison of taxa and taxon concepts in the diatom genus Fragilariopsis: a case study on using slide scanning, multi‐expert image annotation and image analysis in taxonomy
Semi‐automated methods for microscopic image acquisition, image analysis and taxonomic identification have repeatedly received attention in diatom analysis. Less well studied is the question whether and how such methods might prove useful for clarifying the delimitation of species that are difficult to separate for human taxonomists. To try to answer this question, three very similar Fragilariopsis species endemic to the Southern Ocean were targeted in this study: F. obliquecostata, F. ritscheri, and F. sublinearis. A set of 501 extended focus depth specimen images were obtained using a standardized, semi‐automated microscopic procedure. Twelve diatomists independently identified these specimen images in order to reconcile taxonomic opinions and agree upon a taxonomic gold standard. Using image analyses, we then extracted morphometric features representing taxonomic characters of the target taxa. The discriminating ability of individual morphometric features was tested visually and statistically, and multivariate classification experiments were performed to test the agreement of the quantitatively‐defined taxa assignments with expert consensus opinion. Beyond an updated differential diagnosis of the studied taxa, our study also shows that automated imaging and image analysis procedures for diatoms are coming close to reaching a broad applicability for routine use
Accurate Genome Relative Abundance Estimation Based on Shotgun Metagenomic Reads
Accurate estimation of microbial community composition based on metagenomic sequencing data is fundamental for subsequent metagenomics analysis. Prevalent estimation methods are mainly based on directly summarizing alignment results or its variants; often result in biased and/or unstable estimates. We have developed a unified probabilistic framework (named GRAMMy) by explicitly modeling read assignment ambiguities, genome size biases and read distributions along the genomes. Maximum likelihood method is employed to compute Genome Relative Abundance of microbial communities using the Mixture Model theory (GRAMMy). GRAMMy has been demonstrated to give estimates that are accurate and robust across both simulated and real read benchmark datasets. We applied GRAMMy to a collection of 34 metagenomic read sets from four metagenomics projects and identified 99 frequent species (minimally 0.5% abundant in at least 50% of the data- sets) in the human gut samples. Our results show substantial improvements over previous studies, such as adjusting the over-estimated abundance for Bacteroides species for human gut samples, by providing a new reference-based strategy for metagenomic sample comparisons. GRAMMy can be used flexibly with many read assignment tools (mapping, alignment or composition-based) even with low-sensitivity mapping results from huge short-read datasets. It will be increasingly useful as an accurate and robust tool for abundance estimation with the growing size of read sets and the expanding database of reference genomes
The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): illuminating the functional diversity of eukaryotic life in the oceans through transcriptome sequencing.
Microbial ecology is plagued by problems
of an abstract nature. Cell sizes are so
small and population sizes so large that
both are virtually incomprehensible. Niches
are so far from our everyday experience
as to make their very definition elusive.
Organisms that may be abundant and
critical to our survival are little understood,
seldom described and/or cultured,
and sometimes yet to be even seen. One
way to confront these problems is to use
data of an even more abstract nature:
molecular sequence data. Massive environmental
nucleic acid sequencing, such
as metagenomics or metatranscriptomics,
promises functional analysis of microbial
communities as a whole, without prior
knowledge of which organisms are in the
environment or exactly how they are
interacting. But sequence-based ecological
studies nearly always use a comparative
approach, and that requires relevant
reference sequences, which are an extremely
limited resource when it comes to
microbial eukaryotes.
In practice, this means sequence databases
need to be populated with enormous
quantities of data for which we have
some certainties about the source. Most
important is the taxonomic identity of
the organism from which a sequence is
derived and as much functional identification
of the encoded proteins as possible. In
an ideal world, such information would be
available as a large set of complete, well curated,
and annotated genomes for all the
major organisms from the environment
in question. Reality substantially diverges
from this ideal, but at least for bacterial
molecular ecology, there is a database
consisting of thousands of complete genomes
from a wide range of taxa,
supplemented by a phylogeny-driven approach
to diversifying genomics [2]. For
eukaryotes, the number of available genomes
is far, far fewer, and we have relied
much more heavily on random growth of
sequence databases, raising the
question as to whether this is fit for
purpose
The <i>Ectocarpus</i> genome and the independent evolution of multicellularity in brown algae
Brown algae (Phaeophyceae) are complex photosynthetic organisms with a very different evolutionary history to green plants, to which they are only distantly related1. These seaweeds are the dominant species in rocky coastal ecosystems and they exhibit many interesting adaptations to these, often harsh, environments. Brown algae are also one of only a small number of eukaryotic lineages that have evolved complex multicellularity (Fig. 1).We report the 214 million base pair (Mbp) genome sequence of the filamentous seaweed Ectocarpus siliculosus (Dillwyn) Lyngbye, a model organism for brown algae, closely related to the kelps (Fig. 1). Genome features such as the presence of an extended set of light-harvesting and pigment biosynthesis genes and new metabolic processes such as halide metabolism help explain the ability of this organism to cope with the highly variable tidal environment. The evolution of multicellularity in this lineage is correlated with the presence of a rich array of signal transduction genes. Of particular interest is the presence of a family of receptor kinases, as the independent evolution of related molecules has been linked with the emergence of multicellularity in both the animal and green plant lineages. The Ectocarpus genome sequence represents an important step towards developing this organism as a model species, providing the possibility to combine genomic and genetic2 approaches to explore these and other aspects of brown algal biology further
Phylogeography of Ostreopsis along West Pacific Coast, with Special Reference to a Novel Clade from Japan
BACKGROUND: A dinoflagellate genus Ostreopsis is known as a potential producer of Palytoxin derivatives. Palytoxin is the most potent non-proteinaceous compound reported so far. There has been a growing number of reports on palytoxin-like poisonings in southern areas of Japan; however, the distribution of Ostreopsis has not been investigated so far. Morphological plasticity of Ostreopsis makes reliable microscopic identification difficult so the employment of molecular tools was desirable. METHODS/PRINCIPAL FINDING: In total 223 clones were examined from samples mainly collected from southern areas of Japan. The D8-D10 region of the nuclear large subunit rDNA (D8-D10) was selected as a genetic marker and phylogenetic analyses were conducted. Although most of the clones were unable to be identified, there potentially 8 putative species established during this study. Among them, Ostreopsis sp. 1-5 did not belong to any known clade, and each of them formed its own clade. The dominant species was Ostreopsis sp. 1, which accounted for more than half of the clones and which was highly toxic and only distributed along the Japanese coast. Comparisons between the D8-D10 and the Internal Transcribed Spacer (ITS) region of the nuclear rDNA, which has widely been used for phylogenetic/phylogeographic studies in Ostreopsis, revealed that the D8-D10 was less variable than the ITS, making consistent and reliable phylogenetic reconstruction possible. CONCLUSIONS/SIGNIFICANCE: This study unveiled a surprisingly diverse and widespread distribution of Japanese Ostreopsis. Further study will be required to better understand the phylogeography of the genus. Our results posed the urgent need for the development of the early detection/warning systems for Ostreopsis, particularly for the widely distributed and strongly toxic Ostreopsis sp. 1. The D8-D10 marker will be suitable for these purposes
The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): illuminating the functional diversity of eukaryotic life in the oceans through transcriptome sequencing
International audienceCurrent sampling of genomic sequence data from eukaryotes is relatively poor, biased, and inadequate to address important questions about their biology, evolution, and ecology; this Community Page describes a resource of 700 transcriptomes from marine microbial eukaryotes to help understand their role in the world's oceans
- …