24 research outputs found
Turning gold into 'junk': transposable elements utilize central proteins of cellular networks
The numerous discovered cases of domesticated
transposable element (TE) proteins led to the recognition
that TEs are a significant source of evolutionary
innovation. However, much less is known about
the reverse process, whether and to what degree
the evolution of TEs is influenced by the genome
of their hosts. We addressed this issue by searching
for cases of incorporation of host genes into the
sequence of TEs and examined the systems-level
properties of these genes using the Saccharomyces
cerevisiae and Drosophila melanogaster genomes.
We identified 51 cases where the evolutionary
scenario was the incorporation of a host gene
fragment into a TE consensus sequence, and we
show that both the yeast and fly homologues of
the incorporated protein sequences have central
positions in the cellular networks. An analysis of selective
pressure (Ka/Ks ratio) detected significant
selection in 37% of the cases. Recent research on
retrovirus-host interactions shows that virus
proteins preferentially target hubs of the host interaction
networks enabling them to take over the host
cell using only a few proteins. We propose that TEs
face a similar evolutionary pressure to evolve proteins
with high interacting capacities and take some
of the necessary protein domains directly from their
hosts
Structure Prediction and Analysis of DNA Transposon and LINE Retrotransposon Proteins
Despite the considerable amount of research on transposable elements, no large-scale structural analyses of the TE proteome have been performed so far. We predicted the structures of hundreds of proteins from a representative set of DNA and LINE transposable elements and used the obtained structural data to provide the first general structural characterization of TE proteins and to estimate the frequency of TE domestication and horizontal transfer events. We show that 1) ORF1 and Gag proteins of retrotransposons contain high amounts of structural disorder; thus, despite their very low conservation, the presence of disordered regions and probably their chaperone function is conserved. 2) The distribution of SCOP classes in DNA transposons and LINEs indicates that the proteins of DNA transposons are more ancient, containing folds that already existed when the first cellular organisms appeared. 3) DNA transposon proteins have lower contact order than randomly selected reference proteins, indicating rapid folding, most likely to avoid protein aggregation. 4) Structure-based searches for TE homologs indicate that the overall frequency of TE domestication events is low, whereas we found a relatively high number of cases where horizontal transfer, frequently involving parasites, is the most likely explanation for the observed homology
Evolutionary History of Mammalian Transposons Determined by Genome-Wide Defragmentation
The constant bombardment of mammalian genomes by transposable elements (TEs) has resulted in TEs comprising at least 45% of the human genome. Because of their great age and abundance, TEs are important in comparative phylogenomics. However, estimates of TE age were previously based on divergence from derived consensus sequences or phylogenetic analysis, which can be unreliable, especially for older more diverged elements. Therefore, a novel genome-wide analysis of TE organization and fragmentation was performed to estimate TE age independently of sequence composition and divergence or the assumption of a constant molecular clock. Analysis of TEs in the human genome revealed ∼600,000 examples where TEs have transposed into and fragmented other TEs, covering >40% of all TEs or ∼542 Mbp of genomic sequence. The relative age of these TEs over evolutionary time is implicit in their organization, because newer TEs have necessarily transposed into older TEs that were already present. A matrix of the number of times that each TE has transposed into every other TE was constructed, and a novel objective function was developed that derived the chronological order and relative ages of human TEs spanning >100 million years. This method has been used to infer the relative ages across all four major TE classes, including the oldest, most diverged elements. Analysis of DNA transposons over the history of the human genome has revealed the early activity of some MER2 transposons, and the relatively recent activity of MER1 transposons during primate lineages. The TEs from six additional mammalian genomes were defragmented and analyzed. Pairwise comparison of the independent chronological orders of TEs in these mammalian genomes revealed species phylogeny, the fact that transposons shared between genomes are older than species-specific transposons, and a subset of TEs that were potentially active during periods of speciation
Analysis of Transposon Interruptions Suggests Selection for L1 Elements on the X Chromosome
It has been hypothesised that the massive accumulation of L1 transposable elements on the X chromosome is due to their function in X inactivation, and that the accumulation of Alu elements near genes is adaptive. We tested the possible selective advantage of these two transposable element (TE) families with a novel method, interruption analysis. In mammalian genomes, a large number of TEs interrupt other TEs due to the high overall abundance and age of repeats, and these interruptions can be used to test whether TEs are selectively neutral. Interruptions of TEs, which are beneficial for the host, are expected to be deleterious and underrepresented compared with neutral ones. We found that L1 elements in the regions of the X chromosome that contain the majority of the inactivated genes are significantly less frequently interrupted than on the autosomes, while L1s near genes that escape inactivation are interrupted with higher frequency, supporting the hypothesis that L1s on the X chromosome play a role in its inactivation. In addition, we show that TEs are less frequently interrupted in introns than in intergenic regions, probably due to selection against the expansion of introns, but the insertion pattern of Alus is comparable to other repeats
Alpha helices are more robust to mutations than beta strands
The rapidly increasing amount of data on human genetic variation has resulted in a growing demand to identify pathogenic mutations computationally, as their experimental validation is currently beyond reach. Here we show that alpha helices and beta strands differ significantly in their ability to tolerate mutations: helices can accumulate more mutations than strands without change, due to the higher numbers of inter-residue contacts in helices. This results in two patterns: a) the same number of mutations causes less structural change in helices than in strands; b) helices diverge more rapidly in sequence than strands within the same domains. Additionally, both helices and strands are significantly more robust than coils. Based on this observation we show that human missense mutations that change secondary structure are more likely to be pathogenic than those that do not. Moreover, inclusion of predicted secondary structure changes shows significant utility for improving upon state-of-the-art pathogenicity predictions
Structural determinants of Sleeping Beauty transposase activity
Transposases are important tools in genome engineering, and there is considerable interest in engineering more efficient ones. Here, we seek to understand the factors determining their activity using the Sleeping Beauty transposase. Recent work suggests that protein coevolutionary information can be used to classify groups of physically connected, coevolving residues into elements called "sectors", which have proven useful for understanding the folding, allosteric interactions, and enzymatic activity of proteins. Using extensive mutagenesis data, protein modeling and analysis of folding energies, we show that (i) The Sleeping Beauty transposase contains two sectors, which span across conserved domains, and are enriched in DNA-binding residues, indicating that the DNA binding and endonuclease functions of the transposase coevolve; (ii) Sector residues are highly sensitive to mutations, and most mutations of these residues strongly reduce transposition rate; (iii) Mutations with a strong effect on free energy of folding in the DDE domain of the transposase significantly reduce transposition rate. (iv) Mutations that influence DNA and protein-protein interactions generally reduce transposition rate, although most hyperactive mutants are also located on the protein surface, including residues with protein-protein interactions. This suggests that hyperactivity results from the modification of protein interactions, rather than the stabilization of protein fold.Molecular Therapy (2016); doi:10.1038/mt.2016.110
Somatic transposition in the brain has the potential to influence the biosynthesis of metabolites involved in Parkinson’s disease and schizophrenia
<p>Abstract</p> <p>It has been recently discovered that transposable elements show high activity in the brain of mammals, however, the magnitude of their influence on its functioning is unclear so far. In this paper, I use flux balance analysis to examine the influence of somatic retrotransposition on brain metabolism, and the biosynthesis of its key metabolites, including neurotransmitters. The analysis shows that somatic transposition in the human brain can influence the biosynthesis of more than 250 metabolites, including dopamine, serotonin and glutamate, shows large inter-individual variability in metabolic effects, and may contribute to the development of Parkinson’s disease and schizophrenia.</p> <p>Reviewers</p> <p>This article was reviewed by Dr Kenji Kojima (nominated by Dr Jerzy Jurka) and Dr Eugene Koonin.</p