114 research outputs found

    LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Transposable elements are abundant in eukaryotic genomes and it is believed that they have a significant impact on the evolution of gene and chromosome structure. While there are several completed eukaryotic genome projects, there are only few high quality genome wide annotations of transposable elements. Therefore, there is a considerable demand for computational identification of transposable elements. LTR retrotransposons, an important subclass of transposable elements, are well suited for computational identification, as they contain long terminal repeats (LTRs).</p> <p>Results</p> <p>We have developed a software tool <it>LTRharvest </it>for the <it>de novo </it>detection of full length LTR retrotransposons in large sequence sets. <it>LTRharvest </it>efficiently delivers high quality annotations based on known LTR transposon features like length, distance, and sequence motifs. A quality validation of <it>LTRharvest </it>against a gold standard annotation for <it>Saccharomyces cerevisae </it>and <it>Drosophila melanogaster </it>shows a sensitivity of up to 90% and 97% and specificity of 100% and 72%, respectively. This is comparable or slightly better than annotations for previous software tools. The main advantage of <it>LTRharvest </it>over previous tools is (a) its ability to efficiently handle large datasets from finished or unfinished genome projects, (b) its flexibility in incorporating known sequence features into the prediction, and (c) its availability as an open source software.</p> <p>Conclusion</p> <p><it>LTRharvest </it>is an efficient software tool delivering high quality annotation of LTR retrotransposons. It can, for example, process the largest human chromosome in approx. 8 minutes on a Linux PC with 4 GB of memory. Its flexibility and small space and run-time requirements makes <it>LTRharvest </it>a very competitive candidate for future LTR retrotransposon annotation projects. Moreover, the structured design and implementation and the availability as open source provides an excellent base for incorporating novel concepts to further improve prediction of LTR retrotransposons.</p

    Fine-grained annotation and classification of de novo predicted LTR retrotransposons

    Get PDF
    Long terminal repeat (LTR) retrotransposons and endogenous retroviruses (ERVs) are transposable elements in eukaryotic genomes well suited for computational identification. De novo identification tools determine the position of potential LTR retrotransposon or ERV insertions in genomic sequences. For further analysis, it is desirable to obtain an annotation of the internal structure of such candidates. This article presents LTRdigest, a novel software tool for automated annotation of internal features of putative LTR retrotransposons. It uses local alignment and hidden Markov model-based algorithms to detect retrotransposon-associated protein domains as well as primer binding sites and polypurine tracts. As an example, we used LTRdigest results to identify 88 (near) full-length ERVs in the chromosome 4 sequence of Mus musculus, separating them from truncated insertions and other repeats. Furthermore, we propose a work flow for the use of LTRdigest in de novo LTR retrotransposon classification and perform an exemplary de novo analysis on the Drosophila melanogaster genome as a proof of concept. Using a new method solely based on the annotations generated by LTRdigest, 518 potential LTR retrotransposons were automatically assigned to 62 candidate groups. Representative sequences from 41 of these 62 groups were matched to reference sequences with >80% global sequence similarity

    Host–Parasite interactions in Entamoeba histolytica and Entamoeba dispar: what have we learned from their genomes?

    Get PDF
    Invasive amoebiasis caused by Entamoeba histolytica is a major global health problem. Virulence is a rare outcome of infection, occurring in fewer than 1 in 10 infections. Not all strains of the parasite are equally virulent, and understanding the mechanisms and causes of virulence is an important goal of Entamoeba research. The sequencing of the genome of E. histolytica and the related avirulent species Entamoeba dispar has allowed whole-genome-scale analyses of genetic divergence and differential gene expression to be undertaken. These studies have helped elucidate mechanisms of virulence and identified genes differentially expressed in virulent and avirulent parasites. Here, we review the current status of the E. histolytica and E. dispar genomes and the findings of a number of genome-scale studies comparing parasites of different virulence

    Entamoeba Shows Reversible Variation in Ploidy under Different Growth Conditions and between Life Cycle Phases

    Get PDF
    Under axenic growth conditions, trophozoites of Entamoeba histolytica contain heterogenous amounts of DNA due to the presence of both multiple nuclei and different amounts of DNA in individual nuclei. In order to establish if the DNA content and the observed heterogeneity is maintained during different growth conditions, we have compared E. histolytica cells growing in xenic and axenic cultures. Our results show that the nuclear DNA content of E. histolytica trophozoites growing in axenic cultures is at least 10 fold higher than in xenic cultures. Re-association of axenic cultures with their bacterial flora led to a reduction of DNA content to the original xenic values. Thus switching between xenic and axenic growth conditions was accompanied by significant changes in the nuclear DNA content of this parasite. Changes in DNA content during encystation-excystation were studied in the related reptilian parasite E. invadens. During excystation of E. invadens cysts, it was observed that the nuclear DNA content increased approximately 40 fold following emergence of trophozoites in axenic cultures. Based on the observed large changes in nuclear size and DNA content, and the minor differences in relative abundance of representative protein coding sequences, rDNA and tRNA sequences, it appears that gain or loss of whole genome copies may be occurring during changes in the growth conditions. Our studies demonstrate the inherent plasticity and dynamic nature of the Entamoeba genome in at least two species

    Sex determination, sex chromosomes and karyotype evolution in insects

    Get PDF
    Insects harbor a tremendous diversity of sex determining mechanisms both within and between groups. For example, in some orders such as Hymenoptera, all members are haplodiploid, whereas Diptera contain species with homomorphic as well as male and female heterogametic sex chromosome systems or paternal genome elimination. We have established a large database on karyotypes and sex chromosomes in insects, containing information on over 13000 species covering 29 orders of insects. This database constitutes a unique starting point to report phylogenetic patterns on the distribution of sex determination mechanisms, sex chromosomes, and karyotypes among insects and allows us to test general theories on the evolutionary dynamics of karyotypes, sex chromosomes, and sex determination systems in a comparative framework. Phylogenetic analysis reveals that male heterogamety is the ancestral mode of sex determination in insects, and transitions to female heterogamety are extremely rare. Many insect orders harbor species with complex sex chromosomes, and gains and losses of the sex-limited chromosome are frequent in some groups. Haplodiploidy originated several times within insects, and parthenogenesis is rare but evolves frequently. Providing a single source to electronically access data previously distributed among more than 500 articles and books will not only accelerate analyses of the assembled data, but also provide a unique resource to guide research on which taxa are likely to be informative to address specific questions, for example, for genome sequencing projects or large-scale comparative studies

    Differential distribution of a SINE element in the Entamoeba histolytica and Entamoeba dispar genomes: Role of the LINE-encoded endonuclease

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Entamoeba histolytica </it>and <it>Entamoeba dispar </it>are closely related protistan parasites but while <it>E. histolytica </it>can be invasive, <it>E. dispar </it>is completely non pathogenic. Transposable elements constitute a significant portion of the genome in these species; there being three families of LINEs and SINEs. These elements can profoundly influence the expression of neighboring genes. Thus their genomic location can have important phenotypic consequences. A genome-wide comparison of the location of these elements in the <it>E. histolytica </it>and <it>E. dispar </it>genomes has not been carried out. It is also not known whether the retrotransposition machinery works similarly in both species. The present study was undertaken to address these issues.</p> <p>Results</p> <p>Here we extracted all genomic occurrences of full-length copies of EhSINE1 in the <it>E. histolytica </it>genome and matched them with the homologous regions in <it>E. dispar</it>, and vice versa, wherever it was possible to establish synteny. We found that only about 20% of syntenic sites were occupied by SINE1 in both species. We checked whether the different genomic location in the two species was due to differences in the activity of the LINE-encoded endonuclease which is required for nicking the target site. We found that the endonucleases of both species were essentially very similar, both in their kinetic properties and in their substrate sequence specificity. Hence the differential distribution of SINEs in these species is not likely to be influenced by the endonuclease. Further we found that the physical properties of the DNA sequences adjoining the insertion sites were similar in both species.</p> <p>Conclusions</p> <p>Our data shows that the basic retrotransposition machinery is conserved in these sibling species. SINEs may indeed have occupied all of the insertion sites in the genome of the common ancestor of <it>E. histolytica </it>and <it>E. dispar </it>but these may have been subsequently lost from some locations. Alternatively, SINE expansion took place after the divergence of the two species. The absence of SINE1 in 80% of syntenic loci could affect the phenotype of the two species, including their pathogenic properties, which needs to be explored.</p

    Interchromosomal Duplications on the Bactrocera oleae Y Chromosome Imply a Distinct Evolutionary Origin of the Sex Chromosomes Compared to Drosophila

    Get PDF
    BACKGROUND: Diptera have an extraordinary variety of sex determination mechanisms, and Drosophila melanogaster is the paradigm for this group. However, the Drosophila sex determination pathway is only partially conserved and the family Tephritidae affords an interesting example. The tephritid Y chromosome is postulated to be necessary to determine male development. Characterization of Y sequences, apart from elucidating the nature of the male determining factor, is also important to understand the evolutionary history of sex chromosomes within the Tephritidae. We studied the Y sequences from the olive fly, Bactrocera oleae. Its Y chromosome is minute and highly heterochromatic, and displays high heteromorphism with the X chromosome. METHODOLOGY/PRINCIPAL FINDINGS: A combined Representational Difference Analysis (RDA) and fluorescence in-situ hybridization (FISH) approach was used to investigate the Y chromosome to derive information on its sequence content. The Y chromosome is strewn with repetitive DNA sequences, the majority of which are also interdispersed in the pericentromeric regions of the autosomes. The Y chromosome appears to have accumulated small and large repetitive interchromosomal duplications. The large interchromosomal duplications harbour an importin-4-like gene fragment. Apart from these importin-4-like sequences, the other Y repetitive sequences are not shared with the X chromosome, suggesting molecular differentiation of these two chromosomes. Moreover, as the identified Y sequences were not detected on the Y chromosomes of closely related tephritids, we can infer divergence in the repetitive nature of their sequence contents. CONCLUSIONS/SIGNIFICANCE: The identification of Y-linked sequences may tell us much about the repetitive nature, the origin and the evolution of Y chromosomes. We hypothesize how these repetitive sequences accumulated and were maintained on the Y chromosome during its evolutionary history. Our data reinforce the idea that the sex chromosomes of the Tephritidae may have distinct evolutionary origins with respect to those of the Drosophilidae and other Dipteran families

    Cytogenetic analysis of three species of Pseudacteon (Diptera, Phoridae) parasitoids of the fire ants using standard and molecular techniques

    Get PDF
    Pseudacteon flies, parasitoids of worker ants, are being intensively studied as potentially effective agents in the biological control of the invasive pest fire ant genus Solenopsis (Hymenoptera: Formicidae). This is the first attempt to describe the karyotype of P. curvatus Borgmeier, P. nocens Borgmeier and P. tricuspis Borgmeier. The three species possess 2n = 6; chromosomes I and II were metacentric in the three species, but chromosome pair III was subtelocentric in P. curvatus and P. tricuspis, and telocentric in P. nocens. All three species possess a C positive band in chromosome II, lack C positive heterochromatin on chromosome I, and are mostly differentiated with respect to chromosome III. P. curvatus and P. tricuspis possess a C positive band, but at different locations, whereas this band is absent in P. nocens. Heterochromatic bands are neither AT nor GC rich as revealed by fluorescent banding. In situ hybridization with an 18S rDNA probe revealed a signal on chromosome II in a similar location to the C positive band in the three species. The apparent lack of morphologically distinct sex chromosomes is consistent with proposals of environmental sex determination in the genus. Small differences detected in chromosome length and morphology suggests that chromosomes have been highly conserved during the evolutionary radiation of Pseudacteon. Possible mechanisms of karyotype evolution in the three species are suggested

    Serum-Dependent Selective Expression of EhTMKB1-9, a Member of Entamoeba histolytica B1 Family of Transmembrane Kinases

    Get PDF
    Entamoeba histolytica transmembrane kinases (EhTMKs) can be grouped into six distinct families on the basis of motifs and sequences. Analysis of the E. histolytica genome revealed the presence of 35 EhTMKB1 members on the basis of sequence identity (≥95%). Only six homologs were full length containing an extracellular domain, a transmembrane segment and an intracellular kinase domain. Reverse transcription followed by polymerase chain reaction (RT-PCR) of the kinase domain was used to generate a library of expressed sequences. Sequencing of randomly picked clones from this library revealed that about 95% of the clones were identical with a single member, EhTMKB1-9, in proliferating cells. On serum starvation, the relative number of EhTMKB1-9 derived sequences decreased with concomitant increase in the sequences derived from another member, EhTMKB1-18. The change in their relative expression was quantified by real time PCR. Northern analysis and RNase protection assay were used to study the temporal nature of EhTMKB1-9 expression after serum replenishment of starved cells. The results showed that the expression of EhTMKB1-9 was sinusoidal. Specific transcriptional induction of EhTMKB1-9 upon serum replenishment was further confirmed by reporter gene (luciferase) expression and the upstream sequence responsible for serum responsiveness was identified. EhTMKB1-9 is one of the first examples of an inducible gene in Entamoeba. The protein encoded by this member was functionally characterized. The recombinant kinase domain of EhTMKB1-9 displayed protein kinase activity. It is likely to have dual specificity as judged from its sensitivity to different kinase inhibitors. Immuno-localization showed EhTMKB1-9 to be a surface protein which decreased on serum starvation and got relocalized on serum replenishment. Cell lines expressing either EhTMKB1-9 without kinase domain, or EhTMKB1-9 antisense RNA, showed decreased cellular proliferation and target cell killing. Our results suggest that E. histolytica TMKs of B1 family are functional kinases likely to be involved in serum response and cellular proliferation
    corecore