256 research outputs found

    Formation of regulatory modules by local sequence duplication

    Get PDF
    Turnover of regulatory sequence and function is an important part of molecular evolution. But what are the modes of sequence evolution leading to rapid formation and loss of regulatory sites? Here, we show that a large fraction of neighboring transcription factor binding sites in the fly genome have formed from a common sequence origin by local duplications. This mode of evolution is found to produce regulatory information: duplications can seed new sites in the neighborhood of existing sites. Duplicate seeds evolve subsequently by point mutations, often towards binding a different factor than their ancestral neighbor sites. These results are based on a statistical analysis of 346 cis-regulatory modules in the Drosophila melanogaster genome, and a comparison set of intergenic regulatory sequence in Saccharomyces cerevisiae. In fly regulatory modules, pairs of binding sites show significantly enhanced sequence similarity up to distances of about 50 bp. We analyze these data in terms of an evolutionary model with two distinct modes of site formation: (i) evolution from independent sequence origin and (ii) divergent evolution following duplication of a common ancestor sequence. Our results suggest that pervasive formation of binding sites by local sequence duplications distinguishes the complex regulatory architecture of higher eukaryotes from the simpler architecture of unicellular organisms

    Influence of Landscape Structure and Human Modifications on Insect Biomass and Bat Foraging Activity in an Urban Landscape

    Get PDF
    Urban landscapes are often located in biologically diverse, productive regions. As such, urbanization may have dramatic consequences for this diversity, largely due to changes in the structure and function of urban communities. We examined the influence of landscape productivity (indexed by geology), housing density and vegetation clearing on the spatial distribution of nocturnal insect biomass and the foraging activity of insectivorous bats in the urban landscape of Sydney, Australia. Nocturnal insect biomass (g) and bat foraging activity were sampled from 113 sites representing backyard, open space, bushland and riparian landscape elements, across urban, suburban and vegetated landscapes within 60 km of Sydney's Central Business District. We found that insect biomass was at least an order of magnitude greater within suburban landscapes in bushland and backyard elements located on the most fertile shale influenced geologies (both p<0.001) compared to nutrient poor sandstone landscapes. Similarly, the feeding activity of bats was greatest in bushland, and riparian elements within suburbs on fertile geologies (p = 0.039). Regression tree analysis indicated that the same three variables explained the major proportion of the variation in insect biomass and bat foraging activity. These were ambient temperature (positive), housing density (negative) and the percent of fertile shale geologies (positive) in the landscape; however variation in insect biomass did not directly explain bat foraging activity. We suggest that prey may be unavailable to bats in highly urbanized areas if these areas are avoided by many species, suggesting that reduced feeding activity may reflect under-use of urban habitats by bats. Restoration activities to improve ecological function and maintain the activity of a diversity of bat species should focus on maintaining and restoring bushland and riparian habitat, particularly in areas with fertile geology as these were key bat foraging habitats

    Genome of the Avirulent Human-Infective Trypanosome—Trypanosoma rangeli

    Get PDF
    Background: Trypanosoma rangeli is a hemoflagellate protozoan parasite infecting humans and other wild and domestic mammals across Central and South America. It does not cause human disease, but it can be mistaken for the etiologic agent of Chagas disease, Trypanosoma cruzi. We have sequenced the T. rangeli genome to provide new tools for elucidating the distinct and intriguing biology of this species and the key pathways related to interaction with its arthropod and mammalian hosts.  Methodology/Principal Findings: The T. rangeli haploid genome is ,24 Mb in length, and is the smallest and least repetitive trypanosomatid genome sequenced thus far. This parasite genome has shorter subtelomeric sequences compared to those of T. cruzi and T. brucei; displays intraspecific karyotype variability and lacks minichromosomes. Of the predicted 7,613 protein coding sequences, functional annotations could be determined for 2,415, while 5,043 are hypothetical proteins, some with evidence of protein expression. 7,101 genes (93%) are shared with other trypanosomatids that infect humans. An ortholog of the dcl2 gene involved in the T. brucei RNAi pathway was found in T. rangeli, but the RNAi machinery is non-functional since the other genes in this pathway are pseudogenized. T. rangeli is highly susceptible to oxidative stress, a phenotype that may be explained by a smaller number of anti-oxidant defense enzymes and heatshock proteins.  Conclusions/Significance: Phylogenetic comparison of nuclear and mitochondrial genes indicates that T. rangeli and T. cruzi are equidistant from T. brucei. In addition to revealing new aspects of trypanosome co-evolution within the vertebrate and invertebrate hosts, comparative genomic analysis with pathogenic trypanosomatids provides valuable new information that can be further explored with the aim of developing better diagnostic tools and/or therapeutic targets

    IMSA: Integrated metagenomic sequence analysis for identification of exogenous reads in a host genomic background

    Get PDF
    Metagenomics, the study of microbial genomes within diverse environments, is a rapidly developing field. The identification of microbial sequences within a host organism enables the study of human intestinal, respiratory, and skin microbiota, and has allowed the identification of novel viruses in diseases such as Merkel cell carcinoma. There are few publicly available tools for metagenomic high throughput sequence analysis. We present Integrated Metagenomic Sequence Analysis (IMSA), a flexible, fast, and robust computational analysis pipeline that is available for public use. IMSA takes input sequence from high throughput datasets and uses a user-defined host database to filter out host sequence. IMSA then aligns the filtered reads to a user-defined universal database to characterize exogenous reads within the host background. IMSA assigns a score to each node of the taxonomy based on read frequency, and can output this as a taxonomy report suitable for cluster analysis or as a taxonomy map (TaxMap). IMSA also outputs the specific sequence reads assigned to a taxon of interest for downstream analysis. We demonstrate the use of IMSA to detect pathogens and normal flora within sequence data from a primary human cervical cancer carrying HPV16, a primary human cutaneous squamous cell carcinoma carrying HPV 16, the CaSki cell line carrying HPV16, and the HeLa cell line carrying HPV18

    Considering Transposable Element Diversification in De Novo Annotation Approaches

    Get PDF
    Transposable elements (TEs) are mobile, repetitive DNA sequences that are almost ubiquitous in prokaryotic and eukaryotic genomes. They have a large impact on genome structure, function and evolution. With the recent development of high-throughput sequencing methods, many genome sequences have become available, making possible comparative studies of TE dynamics at an unprecedented scale. Several methods have been proposed for the de novo identification of TEs in sequenced genomes. Most begin with the detection of genomic repeats, but the subsequent steps for defining TE families differ. High-quality TE annotations are available for the Drosophila melanogaster and Arabidopsis thaliana genome sequences, providing a solid basis for the benchmarking of such methods. We compared the performance of specific algorithms for the clustering of interspersed repeats and found that only a particular combination of algorithms detected TE families with good recovery of the reference sequences. We then applied a new procedure for reconciling the different clustering results and classifying TE sequences. The whole approach was implemented in a pipeline using the REPET package. Finally, we show that our combined approach highlights the dynamics of well defined TE families by making it possible to identify structural variations among their copies. This approach makes it possible to annotate TE families and to study their diversification in a single analysis, improving our understanding of TE dynamics at the whole-genome scale and for diverse species

    FOXP3 Expression Is Upregulated in CD4+T Cells in Progressive HIV-1 Infection and Is a Marker of Disease Severity

    Get PDF
    Understanding the role of different classes of T cells during HIV infection is critical to determining which responses correlate with protective immunity. To date, it is unclear whether alterations in regulatory T cell (Treg) function are contributory to progression of HIV infection.FOXP3 expression was measured by both qRT-PCR and by flow cytometry in HIV-infected individuals and uninfected controls together with expression of CD25, GITR and CTLA-4. Cultured peripheral blood mononuclear cells were stimulated with anti-CD3 and cell proliferation was assessed by CFSE dilution.HIV infected individuals had significantly higher frequencies of CD4(+)FOXP3(+) T cells (median of 8.11%; range 1.33%-26.27%) than healthy controls (median 3.72%; range 1.3-7.5%; P = 0.002), despite having lower absolute counts of CD4(+)FOXP3(+) T cells. There was a significant positive correlation between the frequency of CD4(+)FOXP3(+) T cells and viral load (rho = 0.593 P = 0.003) and a significant negative correlation with CD4 count (rho = -0.423 P = 0.044). 48% of our patients had CD4 counts below 200 cells/microl and these patients showed a marked elevation of FOXP3 percentage (median 10% range 4.07%-26.27%). Assessing the mechanism of increased FOXP3 frequency, we found that the high FOXP3 levels noted in HIV infected individuals dropped rapidly in unstimulated culture conditions but could be restimulated by T cell receptor stimulation. This suggests that the high FOXP3 expression in HIV infected patients is likely due to FOXP3 upregulation by individual CD4(+) T cells following antigenic or other stimulation.FOXP3 expression in the CD4(+) T cell population is a marker of severity of HIV infection and a potential prognostic marker of disease progression

    Spinning Gland Transcriptomics from Two Main Clades of Spiders (Order: Araneae) - Insights on Their Molecular, Anatomical and Behavioral Evolution

    Get PDF
    Characterized by distinctive evolutionary adaptations, spiders provide a comprehensive system for evolutionary and developmental studies of anatomical organs, including silk and venom production. Here we performed cDNA sequencing using massively parallel sequencers (454 GS-FLX Titanium) to generate ∼80,000 reads from the spinning gland of Actinopus spp. (infraorder: Mygalomorphae) and Gasteracantha cancriformis (infraorder: Araneomorphae, Orbiculariae clade). Actinopus spp. retains primitive characteristics on web usage and presents a single undifferentiated spinning gland while the orbiculariae spiders have seven differentiated spinning glands and complex patterns of web usage. MIRA, Celera Assembler and CAP3 software were used to cluster NGS reads for each spider. CAP3 unigenes passed through a pipeline for automatic annotation, classification by biological function, and comparative transcriptomics. Genes related to spider silks were manually curated and analyzed. Although a single spidroin gene family was found in Actinopus spp., a vast repertoire of specialized spider silk proteins was encountered in orbiculariae. Astacin-like metalloproteases (meprin subfamily) were shown to be some of the most sampled unigenes and duplicated gene families in G. cancriformis since its evolutionary split from mygalomorphs. Our results confirm that the evolution of the molecular repertoire of silk proteins was accompanied by the (i) anatomical differentiation of spinning glands and (ii) behavioral complexification in the web usage. Finally, a phylogenetic tree was constructed to cluster most of the known spidroins in gene clades. This is the first large-scale, multi-organism transcriptome for spider spinning glands and a first step into a broad understanding of spider web systems biology and evolution

    A Wide Extent of Inter-Strain Diversity in Virulent and Vaccine Strains of Alphaherpesviruses

    Get PDF
    Alphaherpesviruses are widespread in the human population, and include herpes simplex virus 1 (HSV-1) and 2, and varicella zoster virus (VZV). These viral pathogens cause epithelial lesions, and then infect the nervous system to cause lifelong latency, reactivation, and spread. A related veterinary herpesvirus, pseudorabies (PRV), causes similar disease in livestock that result in significant economic losses. Vaccines developed for VZV and PRV serve as useful models for the development of an HSV-1 vaccine. We present full genome sequence comparisons of the PRV vaccine strain Bartha, and two virulent PRV isolates, Kaplan and Becker. These genome sequences were determined by high-throughput sequencing and assembly, and present new insights into the attenuation of a mammalian alphaherpesvirus vaccine strain. We find many previously unknown coding differences between PRV Bartha and the virulent strains, including changes to the fusion proteins gH and gB, and over forty other viral proteins. Inter-strain variation in PRV protein sequences is much closer to levels previously observed for HSV-1 than for the highly stable VZV proteome. Almost 20% of the PRV genome contains tandem short sequence repeats (SSRs), a class of nucleic acids motifs whose length-variation has been associated with changes in DNA binding site efficiency, transcriptional regulation, and protein interactions. We find SSRs throughout the herpesvirus family, and provide the first global characterization of SSRs in viruses, both within and between strains. We find SSR length variation between different isolates of PRV and HSV-1, which may provide a new mechanism for phenotypic variation between strains. Finally, we detected a small number of polymorphic bases within each plaque-purified PRV strain, and we characterize the effect of passage and plaque-purification on these polymorphisms. These data add to growing evidence that even plaque-purified stocks of stable DNA viruses exhibit limited sequence heterogeneity, which likely seeds future strain evolution
    corecore