8 research outputs found

    GenSeed-HMM: A tool for progressive assembly using profile HMMs as seeds and its application in Alpavirinae viral discovery from metagenomic data

    Get PDF
    This work reports the development of GenSeed-HMM, a program that implements seed-driven progressive assembly, an approach to reconstruct specific sequences from unassembled data, starting from short nucleotide or protein seed sequences or profile Hidden Markov Models (HMM). The program can use any one of a number of sequence assemblers. Assembly is performed in multiple steps and relatively few reads are used in each cycle, consequently the program demands low computational resources. As a proof-of-concept and to demonstrate the power of HMM-driven progressive assemblies, GenSeed-HMM was applied to metagenomic datasets in the search for diverse ssDNA bacteriophages from the recently described Alpavirinae subfamily. Profile HMMs were built using Alpavirinae-specific regions from multiple sequence alignments using either the viral protein 1 (VP1) (major capsid protein) or VP4 (genome replication initiation protein). These profile HMMs were used by GenSeed-HMM (running Newbler assembler) as seeds to reconstruct viral genomes from sequencing datasets of human fecal samples. All contigs obtained were annotated and taxonomically classified using similarity searches and phylogenetic analyses. The most specific profile HMM seed enabled the reconstruction of 45 partial or complete Alpavirinae genomic sequences. A comparison with conventional (global) assembly of the same original dataset, using Newbler in a standalone execution, revealed that GenSeed-HMM outperformed global genomic assembly in several metrics employed. This approach is capable of detecting organisms that have not been used in the construction of the profile HMM, which opens up the possibility of diagnosing novel viruses, without previous specific information, constituting a de novo diagnosis. Additional applications include, but are not limited to, the specific assembly of extrachromosomal elements such as plastid and mitochondrial genomes from metagenomic data. Profile HMM seeds can also be used to reconstruct specific protein coding genes for gene diversity studies, and to determine all possible gene variants present in a metagenomic sample. Such surveys could be useful to detect the emergence of drug-resistance variants in sensitive environments such as hospitals and animal production facilities, where antibiotics are regularly used. Finally, GenSeed-HMM can be used as an adjunct for gap closure on assembly finishing projects, by using multiple contig ends as anchored seeds

    One sixth of Amazonian tree diversity is dependent on river floodplains

    Get PDF
    Amazonia's floodplain system is the largest and most biodiverse on Earth. Although forests are crucial to the ecological integrity of floodplains, our understanding of their species composition and how this may differ from surrounding forest types is still far too limited, particularly as changing inundation regimes begin to reshape floodplain tree communities and the critical ecosystem functions they underpin. Here we address this gap by taking a spatially explicit look at Amazonia-wide patterns of tree-species turnover and ecological specialization of the region's floodplain forests. We show that the majority of Amazonian tree species can inhabit floodplains, and about a sixth of Amazonian tree diversity is ecologically specialized on floodplains. The degree of specialization in floodplain communities is driven by regional flood patterns, with the most compositionally differentiated floodplain forests located centrally within the fluvial network and contingent on the most extraordinary flood magnitudes regionally. Our results provide a spatially explicit view of ecological specialization of floodplain forest communities and expose the need for whole-basin hydrological integrity to protect the Amazon's tree diversity and its function.Naturali

    Author Correction: One sixth of Amazonian tree diversity is dependent on river floodplains

    Get PDF

    One sixth of Amazonian tree diversity is dependent on river floodplains

    No full text
    Amazonia’s floodplain system is the largest and most biodiverse on Earth. Although forests are crucial to the ecological integrity of floodplains, our understanding of their species composition and how this may differ from surrounding forest types is still far too limited, particularly as changing inundation regimes begin to reshape floodplain tree communities and the critical ecosystem functions they underpin. Here we address this gap by taking a spatially explicit look at Amazonia-wide patterns of tree-species turnover and ecological specialization of the region’s floodplain forests. We show that the majority of Amazonian tree species can inhabit floodplains, and about a sixth of Amazonian tree diversity is ecologically specialized on floodplains. The degree of specialization in floodplain communities is driven by regional flood patterns, with the most compositionally differentiated floodplain forests located centrally within the fluvial network and contingent on the most extraordinary flood magnitudes regionally. Our results provide a spatially explicit view of ecological specialization of floodplain forest communities and expose the need for whole-basin hydrological integrity to protect the Amazon’s tree diversity and its function

    Brazilian Flora 2020: Leveraging the power of a collaborative scientific network

    No full text
    International audienceThe shortage of reliable primary taxonomic data limits the description of biological taxa and the understanding of biodiversity patterns and processes, complicating biogeographical, ecological, and evolutionary studies. This deficit creates a significant taxonomic impediment to biodiversity research and conservation planning. The taxonomic impediment and the biodiversity crisis are widely recognized, highlighting the urgent need for reliable taxonomic data. Over the past decade, numerous countries worldwide have devoted considerable effort to Target 1 of the Global Strategy for Plant Conservation (GSPC), which called for the preparation of a working list of all known plant species by 2010 and an online world Flora by 2020. Brazil is a megadiverse country, home to more of the world's known plant species than any other country. Despite that, Flora Brasiliensis, concluded in 1906, was the last comprehensive treatment of the Brazilian flora. The lack of accurate estimates of the number of species of algae, fungi, and plants occurring in Brazil contributes to the prevailing taxonomic impediment and delays progress towards the GSPC targets. Over the past 12 years, a legion of taxonomists motivated to meet Target 1 of the GSPC, worked together to gather and integrate knowledge on the algal, plant, and fungal diversity of Brazil. Overall, a team of about 980 taxonomists joined efforts in a highly collaborative project that used cybertaxonomy to prepare an updated Flora of Brazil, showing the power of scientific collaboration to reach ambitious goals. This paper presents an overview of the Brazilian Flora 2020 and provides taxonomic and spatial updates on the algae, fungi, and plants found in one of the world's most biodiverse countries. We further identify collection gaps and summarize future goals that extend beyond 2020. Our results show that Brazil is home to 46,975 native species of algae, fungi, and plants, of which 19,669 are endemic to the country. The data compiled to date suggests that the Atlantic Rainforest might be the most diverse Brazilian domain for all plant groups except gymnosperms, which are most diverse in the Amazon. However, scientific knowledge of Brazilian diversity is still unequally distributed, with the Atlantic Rainforest and the Cerrado being the most intensively sampled and studied biomes in the country. In times of “scientific reductionism”, with botanical and mycological sciences suffering pervasive depreciation in recent decades, the first online Flora of Brazil 2020 significantly enhanced the quality and quantity of taxonomic data available for algae, fungi, and plants from Brazil. This project also made all the information freely available online, providing a firm foundation for future research and for the management, conservation, and sustainable use of the Brazilian funga and flora

    Measurement of electrons from semileptonic heavy-flavour hadron decays at midrapidity in pp and Pb–Pb collisions at √sNN = 5.02 TeV

    No full text
    The differential invariant yield as a function of transverse momentum (pT) of electrons from semileptonic heavy-flavour hadron decays was measured at midrapidity in central (0–10%), semi-central (30–50%) and peripheral (60–80%) lead–lead (Pb–Pb) collisions at √sNN = 5.02 TeV in the pT intervals 0.5–26 GeV/c (0–10% and 30–50%) and 0.5–10 GeV/c (60–80%). The production cross section in proton–proton (pp) collisions at √s = 5.02 TeV was measured as well in 0.5 < pT < 10 GeV/c and it lies close to the upper band of perturbative QCD calculation uncertainties up to pT = 5 GeV/c and close to the mean value for larger pT. The modification of the electron yield with respect to what is expected for an incoherent superposition of nucleon–nucleon collisions is evaluated by measuring the nuclear modification factor RAA. The measurement of the RAA in different centrality classes allows in-medium energy loss of charm and beauty quarks to be investigated. The RAA shows a suppression with respect to unity at intermediate pT, which increases while moving towards more central collisions. Moreover, the measured RAA is sensitive to the modification of the parton distribution functions (PDF) in nuclei, like nuclear shadowing, which causes a suppression of the heavy-quark production at low pT in heavy-ion collisions at LHC

    Dielectron and heavy-quark production in inelastic and high-multiplicity proton–proton collisions at √s = 13 TeV

    No full text
    The measurement of dielectron production is presented as a function of invariant mass and transverse momentum (pT) at midrapidity (|ye| < 0.8) in proton–proton (pp) collisions at a centre-of-mass energy of √s = 13 TeV. The contributions from light-hadron decays are calculated from their measured cross sections in pp collisions at √s = 7 TeV or 13 TeV. The remaining continuum stems from correlated semileptonic decays of heavy-flavour hadrons. Fitting the data with templates from two different MC event generators, PYTHIA and POWHEG, the charm and beauty cross sections at midrapidity are extracted for the first time at this collision energy: dσcc¯/dy|y=0 = 974 ± 138 (stat.) ± 140 (syst.) ± 214(BR) μb and dσbb¯ /dy|y=0 = 79 ± 14 (stat.) ± 11 (syst.) ± 5(BR) μb using PYTHIA simulations and dσcc¯/dy|y=0 = 1417 ± 184 (stat.) ± 204 (syst.) ± 312(BR) μb and dσbb¯ /dy|y=0 = 48 ± 14 (stat.) ± 7 (syst.) ± 3(BR) μb for POWHEG. These values, whose uncertainties are fully correlated between the two generators, are consistent with extrapolations from lower energies. The different results obtained with POWHEG and PYTHIA imply different kinematic correlations of the heavy-quark pairs in these two generators. Furthermore, comparisons of dielectron spectra in inelastic events and in events collected with a trigger on high charged-particle multiplicities are presented in various pT intervals. The differences are consistent with the already measured scaling of light-hadron and open-charm production at high charged-particle multiplicity as a function of pT. Upper limits for the contribution of virtual direct photons are extracted at 90% confidence level and found to be in agreement with pQCD calculations

    Direct observation of the dead-cone effect in quantum chromodynamics

    No full text
    At particle collider experiments, elementary particle interactions with large momentum transfer produce quarks and gluons (known as partons) whose evolution is governed by the strong force, as described by the theory of quantum chromodynamics (QCD) [1]. The vacuum is not transparent to the partons and induces gluon radiation and quark pair production in a process that can be described as a parton shower [2]. Studying the pattern of the parton shower is one of the key experimental tools in understanding the properties of QCD. This pattern is expected to depend on the mass of the initiating parton, through a phenomenon known as the dead-cone effect, which predicts a suppression of the gluon spectrum emitted by a heavy quark of mass m and energy E, within a cone of angular size m/E around the emitter [3]. A direct observation of the dead-cone effect in QCD has not been possible until now, due to the challenge of reconstructing the cascading quarks and gluons from the experimentally accessible bound hadronic states. Here we show the first direct observation of the QCD dead-cone by using new iterative declustering techniques [4, 5] to reconstruct the parton shower of charm quarks. This result confirms a fundamental feature of QCD, which is derived more generally from its origin as a gauge quantum field theory. Furthermore, the measurement of a dead-cone angle constitutes the first direct experimental observation of the non-zero mass of the charm quark, which is a fundamental constant in the standard model of particle physics.The direct measurement of the QCD dead cone in charm quark fragmentation is reported, using iterative declustering of jets tagged with a fully reconstructed charmed hadron.In particle collider experiments, elementary particle interactions with large momentum transfer produce quarks and gluons (known as partons) whose evolution is governed by the strong force, as described by the theory of quantum chromodynamics (QCD). These partons subsequently emit further partons in a process that can be described as a parton shower which culminates in the formation of detectable hadrons. Studying the pattern of the parton shower is one of the key experimental tools for testing QCD. This pattern is expected to depend on the mass of the initiating parton, through a phenomenon known as the dead-cone effect, which predicts a suppression of the gluon spectrum emitted by a heavy quark of mass mQm_{\rm{Q}} and energy EE, within a cone of angular size mQm_{\rm{Q}}/EE around the emitter. Previously, a direct observation of the dead-cone effect in QCD had not been possible, owing to the challenge of reconstructing the cascading quarks and gluons from the experimentally accessible hadrons. We report the direct observation of the QCD dead cone by using new iterative declustering techniques to reconstruct the parton shower of charm quarks. This result confirms a fundamental feature of QCD. Furthermore, the measurement of a dead-cone angle constitutes a direct experimental observation of the non-zero mass of the charm quark, which is a fundamental constant in the standard model of particle physics
    corecore