129 research outputs found

    Managing Workflows on top of a Cloud Computing Orchestrator for using heterogeneous environments on e-Science

    Full text link
    [EN] Scientific workflows (SWFs) are widely used to model processes in e-Science. SWFs are executed by means of workflow management systems (WMSs), which orchestrate the workload on top of computing infrastructures. The advent of cloud computing infrastructures has opened the door of using on-demand infrastructures to complement or even replace local infrastructures. However, new issues have arisen, such as the integration of hybrid resources or the compromise between infrastructure reutilisation and elasticity. In this article, we present an ad hoc solution for managing workflows exploiting the capabilities of cloud orchestrators to deploy resources on demand according to the workload and to combine heterogeneous cloud providers (such as on-premise clouds and public clouds) and traditional infrastructures (clusters) to minimise costs and response time. The work does not propose yet another WMS but demonstrates the benefits of the integration of cloud orchestration when running complex workflows. The article shows several configuration experiments from a realistic comparative genomics workflow called Orthosearch, to migrate memory-intensive workload to public infrastructures while keeping other blocks of the experiment running locally. The article computes running time and cost suggesting best practices.This paper wants to acknowledge the support of the EUBrazilCC project, funded by the European Commission (STREP 614048) and the Brazilian MCT/CNPq N. 13/2012, for the use of its infrastructure. The authors would like also to thank the Spanish 'Ministerio de Economia y Competitividad' for the project 'Clusters Virtuales Elasticos y Migrables sobre Infraestructuras Cloud Hibridas' with reference TIN2013-44390-R.Carrión Collado, AA.; Caballer Fernández, M.; Blanquer Espert, I.; Kotowski, N.; Jardim, R.; Dávila, AMR. (2017). Managing Workflows on top of a Cloud Computing Orchestrator for using heterogeneous environments on e-Science. International Journal of Web and Grid Services. 13(4):375-402. doi:10.1504/IJWGS.2017.10003225S37540213

    Improving model construction of profile HMMs for remote homology detection through structural alignment

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Remote homology detection is a challenging problem in Bioinformatics. Arguably, profile Hidden Markov Models (pHMMs) are one of the most successful approaches in addressing this important problem. pHMM packages present a relatively small computational cost, and perform particularly well at recognizing remote homologies. This raises the question of whether structural alignments could impact the performance of pHMMs trained from proteins in the <it>Twilight Zone</it>, as structural alignments are often more accurate than sequence alignments at identifying motifs and functional residues. Next, we assess the impact of using structural alignments in pHMM performance.</p> <p>Results</p> <p>We used the SCOP database to perform our experiments. Structural alignments were obtained using the 3DCOFFEE and MAMMOTH-mult tools; sequence alignments were obtained using CLUSTALW, TCOFFEE, MAFFT and PROBCONS. We performed leave-one-family-out cross-validation over super-families. Performance was evaluated through ROC curves and paired two tailed t-test.</p> <p>Conclusion</p> <p>We observed that pHMMs derived from structural alignments performed significantly better than pHMMs derived from sequence alignment in low-identity regions, mainly below 20%. We believe this is because structural alignment tools are better at focusing on the important patterns that are more often conserved through evolution, resulting in higher quality pHMMs. On the other hand, sensitivity of these tools is still quite low for these low-identity regions. Our results suggest a number of possible directions for improvements in this area.</p

    Microbial Diversity of a Brazilian Coastal Region Influenced by an Upwelling System and Anthropogenic Activity

    Get PDF
    BACKGROUND: Upwelling systems are characterised by an intense primary biomass production in the surface (warmest) water after the outcrop of the bottom (coldest) water, which is rich in nutrients. Although it is known that the microbial assemblage plays an important role in the food chain of marine systems and that the upwelling systems that occur in southwest Brazil drive the complex dynamics of the food chain, little is known about the microbial composition present in this region. METHODOLOGY/PRINCIPAL FINDINGS: We carried out a molecular survey based on SSU rRNA gene from the three domains of the phylogenetic tree of life present in a tropical upwelling region (Arraial do Cabo, Rio de Janeiro, Brazil). The aim was to analyse the horizontal and vertical variations of the microbial composition in two geographically close areas influenced by anthropogenic activity (sewage disposal/port activity) and upwelling phenomena, respectively. A lower estimated diversity of microorganisms of the three domains of the phylogenetic tree of life was found in the water of the area influenced by anthropogenic activity compared to the area influenced by upwelling phenomena. We observed a heterogenic distribution of the relative abundance of taxonomic groups, especially in the Archaea and Eukarya domains. The bacterial community was dominated by Proteobacteria, Cyanobacteria and Bacteroidetes phyla, whereas the microeukaryotic community was dominated by Metazoa, Fungi, Alveolata and Stramenopile. The estimated archaeal diversity was the lowest of the three domains and was dominated by uncharacterised marine Crenarchaeota that were most closely related to Marine Group I. CONCLUSIONS/SIGNIFICANCE: The variety of conditions and the presence of different microbial assemblages indicated that the area of Arraial do Cabo can be used as a model for detailed studies that contemplate the correlation between pollution-indicating parameters and the depletion of microbial diversity in areas close to anthropogenic activity; functional roles and geochemical processes; phylogeny of the uncharacterised diversity; and seasonal variations of the microbial assemblages

    Estimating the global conservation status of more than 15,000 Amazonian tree species

    Get PDF
    Estimates of extinction risk for Amazonian plant and animal species are rare and not often incorporated into land-use policy and conservation planning. We overlay spatial distribution models with historical and projected deforestation to show that at least 36% and up to 57% of all Amazonian tree species are likely to qualify as globally threatened under International Union for Conservation of Nature (IUCN) Red List criteria. If confirmed, these results would increase the number of threatened plant species on Earth by 22%. We show that the trends observed in Amazonia apply to trees throughout the tropics, and we predict thatmost of the world’s >40,000 tropical tree species now qualify as globally threatened. A gap analysis suggests that existing Amazonian protected areas and indigenous territories will protect viable populations of most threatened species if these areas suffer no further degradation, highlighting the key roles that protected areas, indigenous peoples, and improved governance can play in preventing large-scale extinctions in the tropics in this century

    Geographic patterns of tree dispersal modes in Amazonia and their ecological correlates

    Get PDF
    Aim: To investigate the geographic patterns and ecological correlates in the geographic distribution of the most common tree dispersal modes in Amazonia (endozoochory, synzoochory, anemochory and hydrochory). We examined if the proportional abundance of these dispersal modes could be explained by the availability of dispersal agents (disperser-availability hypothesis) and/or the availability of resources for constructing zoochorous fruits (resource-availability hypothesis). Time period: Tree-inventory plots established between 1934 and 2019. Major taxa studied: Trees with a diameter at breast height (DBH) ≥ 9.55 cm. Location: Amazonia, here defined as the lowland rain forests of the Amazon River basin and the Guiana Shield. Methods: We assigned dispersal modes to a total of 5433 species and morphospecies within 1877 tree-inventory plots across terra-firme, seasonally flooded, and permanently flooded forests. We investigated geographic patterns in the proportional abundance of dispersal modes. We performed an abundance-weighted mean pairwise distance (MPD) test and fit generalized linear models (GLMs) to explain the geographic distribution of dispersal modes. Results: Anemochory was significantly, positively associated with mean annual wind speed, and hydrochory was significantly higher in flooded forests. Dispersal modes did not consistently show significant associations with the availability of resources for constructing zoochorous fruits. A lower dissimilarity in dispersal modes, resulting from a higher dominance of endozoochory, occurred in terra-firme forests (excluding podzols) compared to flooded forests. Main conclusions: The disperser-availability hypothesis was well supported for abiotic dispersal modes (anemochory and hydrochory). The availability of resources for constructing zoochorous fruits seems an unlikely explanation for the distribution of dispersal modes in Amazonia. The association between frugivores and the proportional abundance of zoochory requires further research, as tree recruitment not only depends on dispersal vectors but also on conditions that favour or limit seedling recruitment across forest types

    Geography and ecology shape the phylogenetic composition of Amazonian tree communities

    Get PDF
    Aim: Amazonia hosts more tree species from numerous evolutionary lineages, both young and ancient, than any other biogeographic region. Previous studies have shown that tree lineages colonized multiple edaphic environments and dispersed widely across Amazonia, leading to a hypothesis, which we test, that lineages should not be strongly associated with either geographic regions or edaphic forest types. Location: Amazonia. Taxon: Angiosperms (Magnoliids; Monocots; Eudicots). Methods: Data for the abundance of 5082 tree species in 1989 plots were combined with a mega-phylogeny. We applied evolutionary ordination to assess how phylogenetic composition varies across Amazonia. We used variation partitioning and Moran\u27s eigenvector maps (MEM) to test and quantify the separate and joint contributions of spatial and environmental variables to explain the phylogenetic composition of plots. We tested the indicator value of lineages for geographic regions and edaphic forest types and mapped associations onto the phylogeny. Results: In the terra firme and várzea forest types, the phylogenetic composition varies by geographic region, but the igapó and white-sand forest types retain a unique evolutionary signature regardless of region. Overall, we find that soil chemistry, climate and topography explain 24% of the variation in phylogenetic composition, with 79% of that variation being spatially structured (R2^{2} = 19% overall for combined spatial/environmental effects). The phylogenetic composition also shows substantial spatial patterns not related to the environmental variables we quantified (R2^{2} = 28%). A greater number of lineages were significant indicators of geographic regions than forest types. Main Conclusion: Numerous tree lineages, including some ancient ones (>66 Ma), show strong associations with geographic regions and edaphic forest types of Amazonia. This shows that specialization in specific edaphic environments has played a long-standing role in the evolutionary assembly of Amazonian forests. Furthermore, many lineages, even those that have dispersed across Amazonia, dominate within a specific region, likely because of phylogenetically conserved niches for environmental conditions that are prevalent within regions

    Consistent patterns of common species across tropical tree communities

    Get PDF
    Trees structure the Earth’s most biodiverse ecosystem, tropical forests. The vast number of tree species presents a formidable challenge to understanding these forests, including their response to environmental change, as very little is known about most tropical tree species. A focus on the common species may circumvent this challenge. Here we investigate abundance patterns of common tree species using inventory data on 1,003,805 trees with trunk diameters of at least 10 cm across 1,568 locations1,2,3,4,5,6 in closed-canopy, structurally intact old-growth tropical forests in Africa, Amazonia and Southeast Asia. We estimate that 2.2%, 2.2% and 2.3% of species comprise 50% of the tropical trees in these regions, respectively. Extrapolating across all closed-canopy tropical forests, we estimate that just 1,053 species comprise half of Earth’s 800 billion tropical trees with trunk diameters of at least 10 cm. Despite differing biogeographic, climatic and anthropogenic histories7, we find notably consistent patterns of common species and species abundance distributions across the continents. This suggests that fundamental mechanisms of tree community assembly may apply to all tropical forests. Resampling analyses show that the most common species are likely to belong to a manageable list of known species, enabling targeted efforts to understand their ecology. Although they do not detract from the importance of rare species, our results open new opportunities to understand the world’s most diverse forests, including modelling their response to environmental change, by focusing on the common species that constitute the majority of their trees.Publisher PDFPeer reviewe
    corecore