244 research outputs found

    Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor

    Get PDF
    BACKGROUND: Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. RESULTS: We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. CONCLUSION: Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at (RepbaseSubmitter) and (Censor)

    Domestic chickens activate a piRNA defense against avian leukosis virus

    Get PDF
    PIWI-interacting RNAs (piRNAs) protect the germ line by targeting transposable elements (TEs) through the base-pair complementarity. We do not know how piRNAs co-evolve with TEs in chickens. Here we reported that all active TEs in the chicken germ line are targeted by piRNAs, and as TEs lose their activity, the corresponding piRNAs erode away. We observed de novo piRNA birth as host responds to a recent retroviral invasion. Avian leukosis virus (ALV) has endogenized prior to chicken domestication, remains infectious, and threatens poultry industry. Domestic fowl produce piRNAs targeting ALV from one ALV provirus that was known to render its host ALV resistant. This proviral locus does not produce piRNAs in undomesticated wild chickens. Our findings uncover rapid piRNA evolution reflecting contemporary TE activity, identify a new piRNA acquisition modality by activating a pre-existing genomic locus, and extend piRNA defense roles to include the period when endogenous retroviruses are still infectious. DOI: http://dx.doi.org/10.7554/eLife.24695.00

    Endogenous Avian Leukosis Virus subgroup E elements of the chicken reference genome

    Get PDF
    The chicken reference genome contains two endogenous Avian Leukosis Virus subgroup E (ALVE) insertions, but gaps and unresolved repetitive sequences in previous assemblies has hindered their precise characterisation. Detailed analysis of the most recent reference genome (GRCg6a) now shows both ALVEs within contiguous chromosome assemblies for the first time. ALVE6 (ALVE-JFevA) and ALVE-JFevB are both located on chromosome 1, with ALVE6 close to the p arm telomere. ALVE-JFevB is a structurally intact element containing the ALVE gag, pol and env genes, and is capable of forming replication competent viruses. In contrast, ALVE6 (ALVE-JFevA) contains a 3352 bp 5’ truncation and lacks the entire 5’ LTR and gag gene. Despite this, ALVE6 remains able to produce intact envelope protein, likely due to a mutation in the recognition site for a known inhibitory miRNA (miR-155). Whole genome resequencing datasets from layers, broilers and three independent sources of wild-caught red junglefowl were surveyed for the presence of each of these reference genome ALVEs. ALVE-JFevB was found in no other chicken or red junglefowl genomes, whereas ALVE6 was identified in some layers, broilers and native breeds, but not within any other red junglefowl genome. Improved assembly contiguity has facilitated better characterisation of the two ALVEs of the chicken reference genome. However, both the limited ALVE content and unique presence of ALVE-JFevB suggests that the reference individual is unrepresentative of ancestral Gallus gallus ALVE diversity

    Genome-wide evidence for local DNA methylation spreading from small RNA-targeted sequences in Arabidopsis

    Get PDF
    Transposable elements (TEs) and their relics play major roles in genome evolution. However, mobilization of TEs is usually deleterious and strongly repressed. In plants and mammals, this repression is typically associated with DNA methylation, but the relationship between this epigenetic mark and TE sequences has not been investigated systematically. Here, we present an improved annotation of TE sequences and use it to analyze genome-wide DNA methylation maps obtained at single-nucleotide resolution in Arabidopsis. We show that although the majority of TE sequences are methylated, ∼26% are not. Moreover, a significant fraction of TE sequences densely methylated at CG, CHG and CHH sites (where H = A, T or C) have no or few matching small interfering RNA (siRNAs) and are therefore unlikely to be targeted by the RNA-directed DNA methylation (RdDM) machinery. We provide evidence that these TE sequences acquire DNA methylation through spreading from adjacent siRNA-targeted regions. Further, we show that although both methylated and unmethylated TE sequences located in euchromatin tend to be more abundant closer to genes, this trend is least pronounced for methylated, siRNA-targeted TE sequences located 5′ to genes. Based on these and other findings, we propose that spreading of DNA methylation through promoter regions explains at least in part the negative impact of siRNA-targeted TE sequences on neighboring gene expression

    Web services at the European Bioinformatics Institute-2009

    Get PDF
    The European Bioinformatics Institute (EMBL-EBI) has been providing access to mainstream databases and tools in bioinformatics since 1997. In addition to the traditional web form based interfaces, APIs exist for core data resources such as EMBL-Bank, Ensembl, UniProt, InterPro, PDB and ArrayExpress. These APIs are based on Web Services (SOAP/REST) interfaces that allow users to systematically access databases and analytical tools. From the user's point of view, these Web Services provide the same functionality as the browser-based forms. However, using the APIs frees the user from web page constraints and are ideal for the analysis of large batches of data, performing text-mining tasks and the casual or systematic evaluation of mathematical models in regulatory networks. Furthermore, these services are widespread and easy to use; require no prior knowledge of the technology and no more than basic experience in programming. In the following we wish to inform of new and updated services as well as briefly describe planned developments to be made available during the course of 2009–2010

    Fine-grained annotation and classification of de novo predicted LTR retrotransposons

    Get PDF
    Long terminal repeat (LTR) retrotransposons and endogenous retroviruses (ERVs) are transposable elements in eukaryotic genomes well suited for computational identification. De novo identification tools determine the position of potential LTR retrotransposon or ERV insertions in genomic sequences. For further analysis, it is desirable to obtain an annotation of the internal structure of such candidates. This article presents LTRdigest, a novel software tool for automated annotation of internal features of putative LTR retrotransposons. It uses local alignment and hidden Markov model-based algorithms to detect retrotransposon-associated protein domains as well as primer binding sites and polypurine tracts. As an example, we used LTRdigest results to identify 88 (near) full-length ERVs in the chromosome 4 sequence of Mus musculus, separating them from truncated insertions and other repeats. Furthermore, we propose a work flow for the use of LTRdigest in de novo LTR retrotransposon classification and perform an exemplary de novo analysis on the Drosophila melanogaster genome as a proof of concept. Using a new method solely based on the annotations generated by LTRdigest, 518 potential LTR retrotransposons were automatically assigned to 62 candidate groups. Representative sequences from 41 of these 62 groups were matched to reference sequences with >80% global sequence similarity

    A First Glimpse of Wild Lupin Karyotype Variation As Revealed by Comparative Cytogenetic Mapping

    Get PDF
    Insight into plant genomes at the cytomolecular level provides useful information about their karyotype structure, enabling inferences about taxonomic relationships and evolutionary origins. The Old World lupins (OWL) demonstrate a high level of genomic diversification involving variation in chromosome numbers (2n = 32-52), basic chromosome numbers (x = 5-7, 9, 13) and in nuclear genome size (2C DNA = 0.97-2.68 pg). Lupins comprise both crop and wild species and provide an intriguing system to study karyotype evolution. In order to investigate lupin chromosome structure, heterologous FISH was used. Sixteen BACs that had been generated as chromosome markers for the reference species, Lupinus angustifolius, were used to identify chromosomes in the wild species and explore karyotype variation. While all “single-locus” in L. angustifolius, in the wild lupins these clones proved to be “single-locus,” “single-locus” with additional signals, “repetitive” or had no detectable BAC-FISH signal. The diverse distribution of the clones in the targeted genomes suggests a complex evolution history, which possibly involved multiple chromosomal changes such as fusions/fissions and repetitive sequence amplification. Twelve BACs were sequenced and we found numerous transposable elements including DNA transposons as well as LTR and non-LTR retrotransposons with varying quantity and composition among the different lupin species. However, at this preliminary stage, no correlation was observed between the pattern of BAC-FISH signals and the repeat content in particular BACs. Here, we describe the first BAC-based chromosome-specific markers for the wild species: L. cosentinii, L. cryptanthus, L. pilosus, L. micranthus and one New World lupin, L. multiflorus. These BACs could constitute the basis for an assignment of the chromosomal and genetic maps of other lupins, e.g., L. albus and L. luteus. Moreover, we identified karyotype variation that helps illustrate the relationships between the lupins and the extensive cytological diversity within this group. In this study we premise that lupin genomes underwent at least two rounds of fusion and fission events resulting in the reduction in chromosome number from 2n = 52 through 2n = 40 to 2n = 32, followed by chromosome number increment to 2n = 42

    The genomes of two key bumblebee species with primitive eusocial organization

    Get PDF
    Background: The shift from solitary to social behavior is one of the major evolutionary transitions. Primitively eusocial bumblebees are uniquely placed to illuminate the evolution of highly eusocial insect societies. Bumblebees are also invaluable natural and agricultural pollinators, and there is widespread concern over recent population declines in some species. High-quality genomic data will inform key aspects of bumblebee biology, including susceptibility to implicated population viability threats. Results: We report the high quality draft genome sequences of Bombus terrestris and Bombus impatiens, two ecologically dominant bumblebees and widely utilized study species. Comparing these new genomes to those of the highly eusocial honeybee Apis mellifera and other Hymenoptera, we identify deeply conserved similarities, as well as novelties key to the biology of these organisms. Some honeybee genome features thought to underpin advanced eusociality are also present in bumblebees, indicating an earlier evolution in the bee lineage. Xenobiotic detoxification and immune genes are similarly depauperate in bumblebees and honeybees, and multiple categories of genes linked to social organization, including development and behavior, show high conservation. Key differences identified include a bias in bumblebee chemoreception towards gustation from olfaction, and striking differences in microRNAs, potentially responsible for gene regulation underlying social and other traits. Conclusions: These two bumblebee genomes provide a foundation for post-genomic research on these key pollinators and insect societies. Overall, gene repertoires suggest that the route to advanced eusociality in bees was mediated by many small changes in many genes and processes, and not by notable expansion or depauperation

    No Evidence for Natural Selection on Endogenous Borna-Like Nucleoprotein Elements after the Divergence of Old World and New World Monkeys

    Get PDF
    Endogenous Borna-like nucleoprotein (EBLNs) elements were recently discovered as non-retroviral RNA virus elements derived from bornavirus in the genomes of various animals. Most of EBLNs appeared to be defective, but some of primate EBLN-1 to -4, which appeared to be originated from four independent integrations of bornavirus nucleoprotein (N) gene, have retained an open reading frame (ORF) for more than 40 million years. It was therefore possible that primate EBLNs have encoded functional proteins during evolution. To examine this possibility, natural selection operating on all ORFs of primate EBLN-1 to -4 was examined by comparing the rates of synonymous and nonsynonymous substitutions. The expected number of premature termination codons in EBLN-1 generated after the divergence of Old World and New World monkeys under the selective neutrality was also examined by the Monte Carlo simulation. As a result, natural selection was not identified for the entire region as well as parts of ORFs in the pairwise analysis of primate EBLN-1 to -4 and for any branch of the phylogenetic trees for EBLN-1 to -4 after the divergence of Old World and New World monkeys. Computer simulation also indicated that the absence of premature termination codon in the present-day EBLN-1 does not necessarily support the maintenance of function after the divergence of Old World and New World monkeys. These results suggest that EBLNs have not generally encoded functional proteins after the divergence of Old World and New World monkeys
    corecore