22,404 research outputs found

    A Family of Developmentally Excised DNA Elements in \u3cem\u3eTetrahymena\u3c/em\u3e is under Selective Pressure to Maintain an Open Reading Frame Encoding an Integrase-Like Protein

    Get PDF
    Tlr1 is a member of a family of ~20-30 DNA elements that undergo developmentally regulated excision during formation of the macronucleus in the ciliated protozoan Tetrahymena. Analysis of sequence internal to the right boundary of Tlr1 revealed the presence of a 2 kb open reading frame (ORF) encoding a deduced protein with similarity to retrotransposon integrases. The ORFs of five unique clones were sequenced. The ORFs have 98% sequence conservation and align without frameshifts, although one has an additional trinucleotide at codon 561. Nucleotide changes among the five clones are highly non-random with respect to the position in the codon and 93% of the nucleotide changes among the five clones encode identical or similar amino acids, suggesting that the ORF has evolved under selective pressure to preserve a functional protein. Nineteen TIC transitions in T/CAA and T/CAG codons suggest selection has occurred in the context of the Tetrahymena genome, where TAA and TAG encode Gin. Similarities between the ORF and those encoding retrotransposon integrases suggest that the Tlr family of elements may encode a polynucleotide transferase. Possible roles for the protein in transposition of the elements within the micronuclear genome and/or their developmentally regulated excision from the macronucleus are discussed

    Meningococcal genetic variation mechanisms viewed through comparative analysis of Serogroup C strain FAM18

    Get PDF
    Copyright @ 2007 Public Library of ScienceThe bacterium Neisseria meningitidis is commonly found harmlessly colonising the mucosal surfaces of the human nasopharynx. Occasionally strains can invade host tissues causing septicaemia and meningitis, making the bacterium a major cause of morbidity and mortality in both the developed and developing world. The species is known to be diverse in many ways, as a product of its natural transformability and of a range of recombination and mutation-based systems. Previous work on pathogenic Neisseria has identified several mechanisms for the generation of diversity of surface structures, including phase variation based on slippage-like mechanisms and sequence conversion of expressed genes using information from silent loci. Comparison of the genome sequences of two N. meningitidis strains, serogroup B MC58 and serogroup A Z2491, suggested further mechanisms of variation, including C-terminal exchange in specific genes and enhanced localised recombination and variation related to repeat arrays. We have sequenced the genome of N. meningitidis strain FAM18, a representative of the ST-11/ET-37 complex, providing the first genome sequence for the disease-causing serogroup C meningococci; it has 1,976 predicted genes, of which 60 do not have orthologues in the previously sequenced serogroup A or B strains. Through genome comparison with Z2491 and MC58 we have further characterised specific mechanisms of genetic variation in N. meningitidis, describing specialised loci for generation of cell surface protein variants and measuring the association between noncoding repeat arrays and sequence variation in flanking genes. Here we provide a detailed view of novel genetic diversification mechanisms in N. meningitidis. Our analysis provides evidence for the hypothesis that the noncoding repeat arrays in neisserial genomes (neisserial intergenic mosaic elements) provide a crucial mechanism for the generation of surface antigen variants. Such variation will have an impact on the interaction with the host tissues, and understanding these mechanisms is important to aid our understanding of the intimate and complex relationship between the human nasopharynx and the meningococcus.This work was supported by the Wellcome Trust through the Beowulf Genomics Initiative

    PinR mediates the generation of reversible population diversity in Streptococcus zooepidemicus

    Get PDF
    Opportunistic pathogens must adapt to and survive in a wide range of complex ecosystems. Streptococcus zooepidemicus is an opportunistic pathogen of horses and many other animals, including humans. The assembly of different surface architecture phenotypes from one genotype is likely to be crucial to the successful exploitation of such an opportunistic lifestyle. Construction of a series of mutants revealed that a serine recombinase, PinR, inverts 114 bp of the promoter of SZO_08560, which is bordered by GTAGACTTTA and TAAAGTCTAC inverted repeats. Inversion acts as a switch, controlling the transcription of this sortase-processed protein, which may enhance the attachment of S. zooepidemicus to equine trachea. The genome of a recently sequenced strain of S. zooepidemicus, 2329 (Sz2329), was found to contain a disruptive internal inversion of 7 kb of the FimIV pilus locus, which is bordered by TAGAAA and TTTCTA inverted repeats. This strain lacks pinR and this inversion may have become irreversible following the loss of this recombinase. Active inversion of FimIV was detected in three strains of S. zooepidemicus, 1770 (Sz1770), B260863 (SzB260863) and H050840501 (SzH050840501), all of which encoded pinR. A deletion mutant of Sz1770 that lacked pinR was no longer capable of inverting its internal region of FimIV. The data highlight redundancy in the PinR sequence recognition motif around a short TAGA consensus and suggest that PinR can reversibly influence the wider surface architecture of S. zooepidemicus, providing this organism with a bet-hedging solution to survival in fluctuating environments

    Precise sequence complementarity between yeast chromosome ends and two classes of just-subtelomeric sequences

    Get PDF
    The terminal regions (last 20 kb) of Saccharomyces cerevisiae chromosomes universally contain blocks of precise sequence similarity to other chromosome terminal regions. The left and right terminal regions are distinct in the sense that the sequence similarities between them are reverse complements. Direct sequence similarity occurs between the left terminal regions and also between the right terminal regions, but not between any left ends and right ends. With minor exceptions the relationships range from 80% to 100% match within blocks. The regions of similarity are composites of familiar and unfamiliar repeated sequences as well as what could be considered "single-copy" (or better "two-copy") sequences. All terminal regions were compared with all other chromosomes, forward and reverse complement, and 768 comparisons are diagrammed. It appears there has been an extensive history of sequence exchange or copying between terminal regions. The subtelomeric sequences fall into two classes. Seventeen of the chromosome ends terminate with the Y' repeat, while 15 end with the 800-nt "X2" repeats just adjacent to the telomerase simple repeats. The just-subterminal repeats are very similar to each other except that chromosome 1 right end is more divergent

    Diverse Sequences within Tlr Elements Target Programmed DNA Elimination in \u3cem\u3eTetrahymena Thermophila\u3c/em\u3e

    Get PDF
    Tlr elements are a novel family of ~30 putative mobile genetic elements that are conïŹned to the germ line micronuclear genome in Tetrahymena thermophila. Thousands of diverse germ line-limited sequences, including the Tlr elements, are speciïŹcally eliminated from the differentiating somatic macronucleus. Macronucleusretained sequences ïŹ‚anking deleted regions are known to contain cis-acting signals that delineate elimination boundaries. It is unclear whether sequences within deleted DNA also play a regulatory role in the elimination process. In the current study, an in vivo DNA rearrangement assay was used to identify internal sequences required in cis for the elimination of Tlr elements. Multiple, nonoverlapping regions from the ~23-kb Tlr elements were independently sufïŹcient to stimulate developmentally regulated DNA elimination when placed within the context of ïŹ‚anking sequences from the most thoroughly characterized family member, Tlr1. Replacement of element DNA with macronuclear or foreign DNA abolished elimination activity. Thus, diverse sequences dispersed throughout Tlr DNA contain cis-acting signals that target these elements for programmed elimination. Surprisingly, Tlr DNA was also efïŹciently deleted when Tlr1 ïŹ‚anking sequences were replaced with DNA from a region of the genome that is not normally associated with rearrangement, suggesting that speciïŹc ïŹ‚anking sequences are not required for the elimination of Tlr element DNA

    Genome sequence of canine herpesvirus

    Get PDF
    Canine herpesvirus is a widespread alphaherpesvirus that causes a fatal haemorrhagic disease of neonatal puppies. We have used high-throughput methods to determine the genome sequences of three viral strains (0194, V777 and V1154) isolated in the United Kingdom between 1985 and 2000. The sequences are very closely related to each other. The canine herpesvirus genome is estimated to be 125 kbp in size and consists of a unique long sequence (97.5 kbp) and a unique short sequence (7.7 kbp) that are each flanked by terminal and internal inverted repeats (38 bp and 10.0 kbp, respectively). The overall nucleotide composition is 31.6% G+C, which is the lowest among the completely sequenced alphaherpesviruses. The genome contains 76 open reading frames predicted to encode functional proteins, all of which have counterparts in other alphaherpesviruses. The availability of the sequences will facilitate future research on the diagnosis and treatment of canine herpesvirus-associated disease

    The CACTA transposon Bot1 played a major role in Brassica genome divergence and gene proliferation

    Get PDF
    We isolated and characterized a Brassica C genome-specific CACTA element, which was designated Bot1 (Brassica oleracea transposon 1). After analysing phylogenetic relationships, copy numbers and sequence similarity of Bot1 and Bot1 analogues in B. oleracea (C genome) versus Brassica rapa (A genome), we concluded that Bot1 has encountered several rounds of amplification in the oleracea genome only, and has played a major role in the recent rapa and oleracea genome divergence. We performed in silico analyses of the genomic organization and internal structure of Bot1, and established which segment of Bot1 is C-genome specific. Our work reports a fully characterized Brassica repetitive sequence that can distinguish the Brassica A and C chromosomes in the allotetraploid Brassica napus, by fluorescent in situ hybridization. We demonstrated that Bot1 carries a host S locus-associated SLL3 gene copy. We speculate that Bot1 was involved in the proliferation of SLL3 around the Brassica genome. The present study reinforces the assumption that transposons are a major driver of genome and gene evolution in higher plants

    A genome-wide survey of segmental duplications that mediate common human genetic variation of chromosomal architecture.

    Get PDF
    Recent studies have identified a small number of genomic rearrangements that occur frequently in the general population. Bioinformatics tools are now available for systematic genome-wide surveys of higher-order structures predisposing to such common variations in genomic architecture. Segmental duplications (SDs) constitute up to 5 per cent of the genome and play an important role in generating additional rearrangements and in disease aetiology. We conducted a genome-wide database search for a form of SD, palindromic segmental duplications (PSDs), which consist of paired, inverted duplications, and which predispose to inversions, duplications and deletions. The survey was complemented by a search for SDs in tandem orientation (TSDs) that can mediate duplications and deletions but not inversions. We found more than 230 distinct loci with higher-order genomic structure that can mediate genomic variation, of these about 180 contained a PSD. A number of these sites were previously identified as harbouring common inversions or as being associated with specific genomic diseases characterised by duplication, deletions or inversions. Most of the regions, however, were previously unidentified; their characterisation should identify further common rearrangements and may indicate localisations for additional genomic disorders. The widespread distribution of complex chromosomal architecture suggests a potentially high degree of plasticity of the human genome and could uncover another level of genetic variation within human populations

    Characterization of Toxoplasma gondii subtelomeric-like regions: identification of a long-range compositional bias that is also associated with gene-poor regions

    Get PDF
    Background Chromosome ends are composed of telomeric repeats and subtelomeric regions, which are patchworks of genes interspersed with repeated elements. Although chromosome ends display similar arrangements in different species, their sequences are highly divergent. In addition, these regions display a particular nucleosomal composition and bind specific factors, therefore producing a special kind of heterochromatin. Using data from currently available draft genomes we have characterized these putative Telomeric Associated Sequences in Toxoplasma gondii. Results An all-vs-all pairwise comparison of T. gondii assembled chromosomes revealed the presence of conserved regions of ∌ 30 Kb located near the ends of 9 of the 14 chromosomes of the genome of the ME49 strain. Sequence similarity among these regions is ∌ 70%, and they are also highly conserved in the GT1 and VEG strains. However, they are unique to Toxoplasma with no detectable similarity in other Apicomplexan parasites. The internal structure of these sequences consists of 3 repetitive regions separated by high-complexity sequences without annotated genes, except for a gene from the Toxoplasma Specific Family. ChIP-qPCR experiments showed that nucleosomes associated to these sequences are enriched in histone H4 monomethylated at K20 (H4K20me1), and the histone variant H2A.X, suggesting that they are silenced sequences (heterochromatin). A detailed characterization of the base composition of these sequences, led us to identify a strong long-range compositional bias, which was similar to that observed in other genomic silenced fragments such as those containing centromeric sequences, and was negatively correlated to gene density. Conclusions We identified and characterized a region present in most Toxoplasma assembled chromosomes. Based on their location, sequence features, and nucleosomal markers we propose that these might be part of subtelomeric regions of T. gondii. The identified regions display a unique trinucleotide compositional bias, which is shared (despite the lack of any detectable sequence similarity) with other silenced sequences, such as those making up the chromosome centromeres. We also identified other genomic regions with this compositional bias (but no detectable sequence similarity) that might be functionally similar.Fil: Dalmasso, Maria Carolina. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas. Centro CientĂ­fico TecnolĂłgico Conicet - La Plata. Instituto de Investigaciones BiotecnolĂłgicas. Instituto de Investigaciones BiotecnolĂłgicas "Dr. RaĂșl AlfonsĂ­n" (sede ChascomĂșs). Universidad Nacional de San MartĂ­n. Instituto de Investigaciones BiotecnolĂłgicas. Instituto de Investigaciones BiotecnolĂłgicas "Dr. RaĂșl AlfonsĂ­n" (sede ChascomĂșs); ArgentinaFil: Carmona, Santiago Javier. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas. Centro CientĂ­fico TecnolĂłgico Conicet - La Plata. Instituto de Investigaciones BiotecnolĂłgicas. Instituto de Investigaciones BiotecnolĂłgicas "Dr. RaĂșl AlfonsĂ­n" (sede ChascomĂșs). Universidad Nacional de San MartĂ­n. Instituto de Investigaciones BiotecnolĂłgicas. Instituto de Investigaciones BiotecnolĂłgicas "Dr. RaĂșl AlfonsĂ­n" (sede ChascomĂșs); ArgentinaFil: Ángel, Sergio Oscar. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas. Centro CientĂ­fico TecnolĂłgico Conicet - La Plata. Instituto de Investigaciones BiotecnolĂłgicas. Universidad Nacional de San MartĂ­n. Instituto de Investigaciones BiotecnolĂłgicas; ArgentinaFil: AgĂŒero, Fernan Gonzalo. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas. Centro CientĂ­fico TecnolĂłgico Conicet - La Plata. Instituto de Investigaciones BiotecnolĂłgicas. Universidad Nacional de San MartĂ­n. Instituto de Investigaciones BiotecnolĂłgicas; Argentin

    Potent CRISPR-Cas9 inhibitors from Staphylococcus genomes.

    Get PDF
    Anti-CRISPRs (Acrs) are small proteins that inhibit the RNA-guided DNA targeting activity of CRISPR-Cas enzymes. Encoded by bacteriophage and phage-derived bacterial genes, Acrs prevent CRISPR-mediated inhibition of phage infection and can also block CRISPR-Cas-mediated genome editing in eukaryotic cells. To identify Acrs capable of inhibiting Staphylococcus aureus Cas9 (SauCas9), an alternative to the most commonly used genome editing protein Streptococcus pyogenes Cas9 (SpyCas9), we used both self-targeting CRISPR screening and guilt-by-association genomic search strategies. Here we describe three potent inhibitors of SauCas9 that we name AcrIIA13, AcrIIA14, and AcrIIA15. These inhibitors share a conserved N-terminal sequence that is dispensable for DNA cleavage inhibition and have divergent C termini that are required in each case for inhibition of SauCas9-catalyzed DNA cleavage. In human cells, we observe robust inhibition of SauCas9-induced genome editing by AcrIIA13 and moderate inhibition by AcrIIA14 and AcrIIA15. We also find that the conserved N-terminal domain of AcrIIA13-AcrIIA15 binds to an inverted repeat sequence in the promoter of these Acr genes, consistent with its predicted helix-turn-helix DNA binding structure. These data demonstrate an effective strategy for Acr discovery and establish AcrIIA13-AcrIIA15 as unique bifunctional inhibitors of SauCas9
    • 

    corecore