515 research outputs found
Toward major evolutionary transitions theory 2.0
The impressive body of work on the major evolutionary transitions in the last 20 y calls for a reconstruction of the theory although a 2D account (evolution of informational systems and transitions in individuality) remains. Significant advances include the concept of fraternal and egalitarian transitions (lower-level units like and unlike, respectively). Multilevel selection, first without, then with, the collectives in focus is an important explanatory mechanism. Transitions are decomposed into phases of origin, maintenance, and transformation (i.e., further evolution) of the higher level units, which helps reduce the number of transitions in the revised list by two so that it is less top-heavy. After the transition, units show strong cooperation and very limited realized conflict. The origins of cells, the emergence of the genetic code and translation, the evolution of the eukaryotic cell, multicellularity, and the origin of human groups with language are reconsidered in some detail in the light of new data and considerations. Arguments are given why sex is not in the revised list as a separate transition. Some of the transitions can be recursive (e.g., plastids, multicellularity) or limited (transitions that share the usual features of major transitions without a massive phylogenetic impact, such as the micro- and macronuclei in ciliates). During transitions, new units of reproduction emerge, and establishment of such units requires high fidelity of reproduction (as opposed to mere replication)
Research: A comprehensive and quantitative exploration of thousands of viral genomes
The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends – such as gene density, noncoding percentage, and abundances of functional gene categories – across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends
Artificial and Natural Genetic Information Processing
Conventional methods of genetic engineering and more recent genome editing techniques focus on identifying genetic target sequences for manipulation. This is a result of historical concept of the gene which was also the main assumption of the ENCODE project designed to identify all functional elements in the human genome sequence.
However, the theoretical core concept changed dramatically. The old concept of genetic sequences which can be assembled and manipulated like molecular bricks has problems in explaining the natural genome-editing competences of viruses and RNA consortia that are able to insert or delete, combine and recombine genetic sequences
more precisely than random-like into cellular host organisms according to adaptational needs or even generate sequences de novo. Increasing knowledge about natural genome editing questions the traditional narrative of mutations (error replications) as essential for generating genetic diversity and genetic content arrangements in biological systems. This may have far-reaching consequences for our understanding
of artificial genome editing
The ancient Virus World and evolution of cells
BACKGROUND: Recent advances in genomics of viruses and cellular life forms have greatly stimulated interest in the origins and evolution of viruses and, for the first time, offer an opportunity for a data-driven exploration of the deepest roots of viruses. Here we briefly review the current views of virus evolution and propose a new, coherent scenario that appears to be best compatible with comparative-genomic data and is naturally linked to models of cellular evolution that, from independent considerations, seem to be the most parsimonious among the existing ones. RESULTS: Several genes coding for key proteins involved in viral replication and morphogenesis as well as the major capsid protein of icosahedral virions are shared by many groups of RNA and DNA viruses but are missing in cellular life forms. On the basis of this key observation and the data on extensive genetic exchange between diverse viruses, we propose the concept of the ancient virus world. The virus world is construed as a distinct contingent of viral genes that continuously retained its identity throughout the entire history of life. Under this concept, the principal lineages of viruses and related selfish agents emerged from the primordial pool of primitive genetic elements, the ancestors of both cellular and viral genes. Thus, notwithstanding the numerous gene exchanges and acquisitions attributed to later stages of evolution, most, if not all, modern viruses and other selfish agents are inferred to descend from elements that belonged to the primordial genetic pool. In this pool, RNA viruses would evolve first, followed by retroid elements, and DNA viruses. The Virus World concept is predicated on a model of early evolution whereby emergence of substantial genetic diversity antedates the advent of full-fledged cells, allowing for extensive gene mixing at this early stage of evolution. We outline a scenario of the origin of the main classes of viruses in conjunction with a specific model of precellular evolution under which the primordial gene pool dwelled in a network of inorganic compartments. Somewhat paradoxically, under this scenario, we surmise that selfish genetic elements ancestral to viruses evolved prior to typical cells, to become intracellular parasites once bacteria and archaea arrived at the scene. Selection against excessively aggressive parasites that would kill off the host ensembles of genetic elements would lead to early evolution of temperate virus-like agents and primitive defense mechanisms, possibly, based on the RNA interference principle. The emergence of the eukaryotic cell is construed as the second melting pot of virus evolution from which the major groups of eukaryotic viruses originated as a result of extensive recombination of genes from various bacteriophages, archaeal viruses, plasmids, and the evolving eukaryotic genomes. Again, this vision is predicated on a specific model of the emergence of eukaryotic cell under which archaeo-bacterial symbiosis was the starting point of eukaryogenesis, a scenario that appears to be best compatible with the data. CONCLUSION: The existence of several genes that are central to virus replication and structure, are shared by a broad variety of viruses but are missing from cellular genomes (virus hallmark genes) suggests the model of an ancient virus world, a flow of virus-specific genes that went uninterrupted from the precellular stage of life's evolution to this day. This concept is tightly linked to two key conjectures on evolution of cells: existence of a complex, precellular, compartmentalized but extensively mixing and recombining pool of genes, and origin of the eukaryotic cell by archaeo-bacterial fusion. The virus world concept and these models of major transitions in the evolution of cells provide complementary pieces of an emerging coherent picture of life's history. REVIEWERS: W. Ford Doolittle, J. Peter Gogarten, and Arcady Mushegian
Primal Eukaryogenesis:On the Communal Nature of Precellular States, Ancestral to Modern Life
This problem-oriented, exploratory and hypothesis-driven discourse toward the unknown combines several basic tenets: (i) a photo-active metal sulfide scenario of primal biogenesis in the porespace of shallow sedimentary flats, in contrast to hot deep-sea hydrothermal vent conditions; (ii) an inherently complex communal system at the common root of present life forms; (iii) a high degree of internal compartmentalization at this communal root, progressively resembling coenocytic (syncytial) super-cells; (iv) a direct connection from such communal super-cells to proto-eukaryotic macro-cell organization; and (v) multiple rounds of micro-cellular escape with streamlined reductive evolution—leading to the major prokaryotic cell lines, as well as to megaviruses and other viral lineages. Hopefully, such nontraditional concepts and approaches will contribute to coherent and plausible views about the origins and early life on Earth. In particular, the coevolutionary emergence from a communal system at the common root can most naturally explain the vast discrepancy in subcellular organization between modern eukaryotes on the one hand and both archaea and bacteria on the other
Varieties of living things: Life at the intersection of lineage and metabolism
publication-status: Publishedtypes: Articl
The combinatorics of overlapping genes
Overlapping genes exist in all domains of life and are much more abundant
than expected at their first discovery in the late 1970s. Assuming that the
reference gene is read in frame +0, an overlapping gene can be encoded in two
reading frames in the sense strand, denoted by +1 and +2, and in three reading
frames in the opposite strand, denoted by -0, -1 and -2. This motivated
numerous researchers to study the constraints induced by the genetic code on
the various overlapping frames, mostly based on information theory. Our focus
in this paper is on the constraints induced on two overlapping genes in terms
of amino acids, as well as polypeptides. We show that simple linear constraints
bind the amino acid composition of two proteins encoded by overlapping genes.
Novel constraints are revealed when polypeptides are considered, and not just
single amino acids. For example, in double-coding sequences with an overlapping
reading frame -2, each Tyrosine (denoted as Tyr or Y) in the overlapping frame
overlaps a Tyrosine in the reference frame +0 (and reciprocally), whereas
specific words (e.g. YY) never occur. We thus distinguish between null
constraints (YY = 0 in frame -2) and non-null constraints (Y in frame +0 Y
in frame -2). Our equivalence-based constraints are symmetrical and thus enable
the characterization of the joint composition of overlapping proteins. We
describe several formal frameworks and a graph algorithm to characterize and
compute these constraints. These results yield support for understanding the
mechanisms and evolution of overlapping genes, and for developing novel
overlapping gene detection methods
Origin, evolution and stability of overlapping genes in viruses: A systematic review
During their long evolutionary history viruses generated many proteins de novo by a mechanism called “overprinting”. Overprinting is a process in which critical nucleotide substitutions in a pre-existing gene can induce the expression of a novel protein by translation of an alternative open reading frame (ORF). Overlapping genes represent an intriguing example of adaptive conflict, because they simultaneously encode two proteins whose freedom to change is constrained by each other. However, overlapping genes are also a source of genetic novelties, as the constraints under which alternative ORFs evolve can give rise to proteins with unusual sequence properties, most importantly the potential for novel functions. Starting with the discovery of overlapping genes in phages infecting Escherichia coli, this review covers a range of studies dealing with detection of overlapping genes in small eukaryotic viruses (genomic length below 30 kb) and recognition of their critical role in the evolution of pathogenicity. Origin of overlapping genes, what factors favor their birth and retention, and how they manage their inherent adaptive conflict are extensively reviewed. Special attention is paid to the assembly of overlapping genes into ad hoc databases, suitable for future studies, and to the development of statistical methods for exploring viral genome sequences in search of undiscovered overlaps
A statistical analysis of the three-fold evolution of genomic compression through frame overlaps in prokaryotes
<p>Abstract</p> <p>Background</p> <p>Among microbial genomes, genetic information is frequently compressed, exploiting redundancies in the genetic code in order to store information in overlapping genes. We investigate the length, phase and orientation properties of overlap in 58 prokaryotic species evaluating neutral and selective mechanisms of evolution.</p> <p>Results</p> <p>Using a variety of statistical null models we find patterns of compressive coding that can not be explained purely in terms of the selective processes favoring genome minimization or translational coupling. The distribution of overlap lengths follows a fat-tailed distribution, in which a significant proportion of overlaps are in excess of 100 base pairs in length. The phase of overlap – pairing of codon positions in complementary reading frames – is strongly predicted by the translation orientation of each gene. We find that as overlapping genes become longer, they have a tendency to alternate among alternative overlap phases. Some phases seem to reflect codon pairings reducing the probability of non-synonymous substitution. We analyze the lineage-dependent features of overlapping genes by tracing a number of different continuous characters through the prokaryotic phylogeny using squared-change parsimony and observe both clade-specific and species-specific patterns.</p> <p>Conclusion</p> <p>Overlapping reading frames preserve in their structure, features relating to mutational origination of new genes, but have undergone modification for both immediate benefits and for variational buffering and amplification. Genomes come under a variety of different mutational and selectional pressures, and the structure of redundancies in overlapping genes can be used to detect these pressures. No single mechanism is able to account for all the variability observed among the set of prokaryotic overlapping genes but a three-fold analysis of evolutionary events provides a more integrative framework.</p> <p>Reviewers</p> <p>This article was reviewed by Eugene Koonin, Marten Huynem, and Han Liang.</p
ORFs, StORFs and Pseudogenes :Uncovering Novel Genomic Knowledge in Prokaryotic and Viral Genomes
- …
