23 research outputs found

    Interpolative multidimensional scaling techniques for the identification of clusters in very large sequence sets

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Modern pyrosequencing techniques make it possible to study complex bacterial populations, such as <it>16S rRNA</it>, directly from environmental or clinical samples without the need for laboratory purification. Alignment of sequences across the resultant large data sets (100,000+ sequences) is of particular interest for the purpose of identifying potential gene clusters and families, but such analysis represents a daunting computational task. The aim of this work is the development of an efficient pipeline for the clustering of large sequence read sets.</p> <p>Methods</p> <p>Pairwise alignment techniques are used here to calculate genetic distances between sequence pairs. These methods are pleasingly parallel and have been shown to more accurately reflect accurate genetic distances in highly variable regions of <it>rRNA </it>genes than do traditional multiple sequence alignment (MSA) approaches. By utilizing Needleman-Wunsch (NW) pairwise alignment in conjunction with novel implementations of interpolative multidimensional scaling (MDS), we have developed an effective method for visualizing massive biosequence data sets and quickly identifying potential gene clusters.</p> <p>Results</p> <p>This study demonstrates the use of interpolative MDS to obtain clustering results that are qualitatively similar to those obtained through full MDS, but with substantial cost savings. In particular, the wall clock time required to cluster a set of 100,000 sequences has been reduced from seven hours to less than one hour through the use of interpolative MDS.</p> <p>Conclusions</p> <p>Although work remains to be done in selecting the optimal training set size for interpolative MDS, substantial computational cost savings will allow us to cluster much larger sequence sets in the future.</p

    Ortholog of the polymerase theta helicase domain modulates DNA replication in Trypanosoma cruzi

    Get PDF
    DNA polymerase theta (Polθ), a member of the DNA polymerase family A, exhibits a polymerase C-terminal domain, a central domain, and an N-terminal helicase domain. Polθ plays important roles in DNA repair via its polymerase domain, regulating genome integrity. In addition, in mammals, Polθ modulates origin firing timing and MCM helicase recruitment to chromatin. In contrast, as a model eukaryote, Trypanosoma cruzi exhibits two individual putative orthologs of Polθ in different genomic loci; one ortholog is homologous to the Polθ C-terminal polymerase domain, and the other is homologous to the Polθ helicase domain, called Polθ-polymerase and Polθ-helicase, respectively. A pull-down assay using the T. cruzi component of the prereplication complex Orc1/Cdc6 as bait captured Polθ-helicase from the nuclear extract. Orc1/Cdc6 and Polθ-helicase directly interacted, and Polθ-helicase presented DNA unwinding and ATPase activities. A T. cruzi strain overexpressing the Polθ-helicase domain exhibited a significantly decreased amount of DNA-bound MCM7 and impaired replication origin firing. Taken together, these data suggest that Polθ-helicase modulates DNA replication by directly interacting with Orc1/Cdc6, which reduces the binding of MCM7 to DNA and thereby impairs the firing of replication origins

    How vegetation reinforces soil on slopes

    No full text
    International audienceOnce the instability process e.g. erosion or landslides has been identified on a slope, the type of vegetation to best reinforce the soil can then be determined. Plants improve slope stability through changes in mechanical and hydrological properties of the root-soil matrix. The architecture of a plants root system will influence strongly these reinforcing properties. We explain how root morphology and biomechanics changes between species. An overview of vegetation effects on slope hydrology is given, along with an update on the use of models to predict the influence of vegetation on mechanical and hydrological properties of soil on slopes. In conclusion, the optimal root system types for improving slope stability are suggeste
    corecore