Search CORE

340 research outputs found

The organization of biological sequences into constrained and unconstrained parts determines fundamental properties of genotype-phenotype maps.

Author: Ahnert SE
Greenbury SF
Publication venue: J R Soc Interface
Publication date: 25/11/2015
Field of study

Biological information is stored in DNA, RNA and protein sequences, which can be understood as genotypes that are translated into phenotypes. The properties of genotype-phenotype (GP) maps have been studied in great detail for RNA secondary structure. These include a highly biased distribution of genotypes per phenotype, negative correlation of genotypic robustness and evolvability, positive correlation of phenotypic robustness and evolvability, shape-space covering, and a roughly logarithmic scaling of phenotypic robustness with phenotypic frequency. More recently similar properties have been discovered in other GP maps, suggesting that they may be fundamental to biological GP maps, in general, rather than specific to the RNA secondary structure map. Here we propose that the above properties arise from the fundamental organization of biological information into 'constrained' and 'unconstrained' sequences, in the broadest possible sense. As 'constrained' we describe sequences that affect the phenotype more immediately, and are therefore more sensitive to mutations, such as, e.g. protein-coding DNA or the stems in RNA secondary structure. 'Unconstrained' sequences, on the other hand, can mutate more freely without affecting the phenotype, such as, e.g. intronic or intergenic DNA or the loops in RNA secondary structure. To test our hypothesis we consider a highly simplified GP map that has genotypes with 'coding' and 'non-coding' parts. We term this the Fibonacci GP map, as it is equivalent to the Fibonacci code in information theory. Despite its simplicity the Fibonacci GP map exhibits all the above properties of much more complex and biologically realistic GP maps. These properties are therefore likely to be fundamental to many biological GP maps.SEA was supported by The Royal Society. SFG was supported by the EPSRC.This is the final version of the article. It was first available from Royal Society Publishing via http://dx.doi.org/10.1098/rsif.2015.072

Crossref

PubMed Central

Spiral - Imperial College Digital Repository

Apollo (Cambridge)

A tractable genotype-phenotype map for the self-assembly of protein quaternary structure

Author: Ahnert Sebastian E.
Greenbury Sam F.
Johnston Iain G.
Louis Ard A.
Publication venue
Publication date: 02/11/2013
Field of study

The mapping between biological genotypes and phenotypes is central to the study of biological evolution. Here we introduce a rich, intuitive, and biologically realistic genotype-phenotype (GP) map, that serves as a model of self-assembling biological structures, such as protein complexes, and remains computationally and analytically tractable. Our GP map arises naturally from the self-assembly of polyomino structures on a 2D lattice and exhibits a number of properties:

\textit{redundancy}

(genotypes vastly outnumber phenotypes),

\textit{phenotype bias}

(genotypic redundancy varies greatly between phenotypes),

\textit{genotype component disconnectivity}

(phenotypes consist of disconnected mutational networks) and

\textit{shape space covering}

(most phenotypes can be reached in a small number of mutations). We also show that the mutational robustness of phenotypes scales very roughly logarithmically with phenotype redundancy and is positively correlated with phenotypic evolvability. Although our GP map describes the assembly of disconnected objects, it shares many properties with other popular GP maps for connected units, such as models for RNA secondary structure or the HP lattice model for protein tertiary structure. The remarkable fact that these important properties similarly emerge from such different models suggests the possibility that universal features underlie a much wider class of biologically realistic GP maps.Comment: 12 pages, 6 figure

arXiv.org e-Print Archive

Crossref

University of Birmingham Research Portal

PubMed Central

HyperTraPS: Inferring probabilistic patterns of trait acquisition in evolutionary and disease progression pathways

Author: Barahona M
Greenbury S
Johnston I
Publication venue: 'Elsevier BV'
Publication date: 26/10/2019
Field of study

The explosion of data throughout the biomedical sciences provides unprecedented opportunities to learn about the dynamics of evolution and disease progression, but harnessing these large and diverse datasets remains challenging. Here, we describe a highly generalisable statistical platform to infer the dynamic pathways by which many, potentially interacting, discrete traits are acquired or lost over time in biomedical systems. The platform uses HyperTraPS (hypercubic transition path sampling) to learn progression pathways from cross-sectional, longitudinal, or phylogenetically-linked data with unprecedented efficiency, readily distinguishing multiple competing pathways, and identifying the most parsimonious mechanisms underlying given observations. Its Bayesian structure quantifies uncertainty in pathway structure and allows interpretable predictions of behaviours, such as which symptom a patient will acquire next. We exploit the model’s topology to provide visualisation tools for intuitive assessment of multiple, variable pathways. We apply the method to ovarian cancer progression and the evolution of multidrug resistance in tuberculosis, demonstrating its power to reveal previously undetected dynamic pathways

arXiv.org e-Print Archive

University of Bergen

Spiral - Imperial College Digital Repository

NORA - Norwegian Open Research Archives

Genetic correlations greatly increase mutational robustness and can both reduce and enhance evolvability

Author: Ahnert SE
Greenbury SF
Louis AA
Schaper S
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 24/01/2016
Field of study

Mutational neighbourhoods in genotype-phenotype (GP) maps are widely believed to be more likely to share characteristics than expected from random chance. Such genetic correlations should strongly influence evolutionary dynamics. We explore and quantify these intuitions by comparing three GP maps—a model for RNA secondary structure, the HP model for protein tertiary structure, and the Polyomino model for protein quaternary structure—to a simple random null model that maintains the number of genotypes mapping to each phenotype, but assigns genotypes randomly. The mutational neighbourhood of a genotype in these GP maps is much more likely to contain genotypes mapping to the same phenotype than in the random null model. Such neutral correlations can be quantified by the robustness to mutations, which can be many orders of magnitude larger than that of the null model, and crucially, above the critical threshold for the formation of large neutral networks of mutationally connected genotypes which enhance the capacity for the exploration of phenotypic novelty. Thus neutral correlations increase evolvability. We also study non-neutral correlations: Compared to the null model, i) If a particular (non-neutral) phenotype is found once in the 1-mutation neighbourhood of a genotype, then the chance of finding that phenotype multiple times in this neighbourhood is larger than expected; ii) If two genotypes are connected by a single neutral mutation, then their respective non-neutral 1-mutation neighbourhoods are more likely to be similar; iii) If a genotype maps to a folding or self-assembling phenotype, then its non-neutral neighbours are less likely to be a potentially deleterious non-folding or non-assembling phenotype. Non-neutral correlations of type i) and ii) reduce the rate at which new phenotypes can be found by neutral exploration, and so may diminish evolvability, while non-neutral correlations of type iii) may instead facilitate evolutionary exploration and so increase evolvability

Spiral - Imperial College Digital Repository

Trustchain -- Trustworthy Decentralised Public Key Infrastructure for Digital Credentials

Author: France Lydia
Greenbury Sam
Hare Luke
Hobson Tim
Wochner Pamela
Publication venue
Publication date: 05/01/2024
Field of study

The sharing of public key information is central to the digital credential security model, but the existing Web PKI with its opaque Certification Authorities and synthetic attestations serves a very different purpose. We propose a new approach to decentralised public key infrastructure, designed for digital identity, in which connections between legal entities that are represented digitally correspond to genuine, pre-existing relationships between recognisable institutions. In this scenario, users can judge for themselves the level of trust they are willing to place in a given chain of attestations. Our proposal includes a novel mechanism for establishing a root of trust in a decentralised setting via independently-verifiable timestamping. We also present a reference implementation built on open networks, protocols and standards. The system has minimal setup costs and is freely available for any community to adopt as a digital public good.Comment: 10 pages, 4 figures, presented at the International Conference on AI and the Digital Economy (CADE 2023), Venice, Italy. Replaces the preprint version, with minor changes & additions based on reviewers' comment

arXiv.org e-Print Archive

Genetic Correlations Greatly Increase Mutational Robustness and Can Both Reduce and Enhance Evolvability.

Author: Ahnert Sebastian E
Greenbury Sam F
Louis Ard A
Schaper Steffen
Publication venue: PLoS Comput Biol
Publication date: 01/01/2015
Field of study

Mutational neighbourhoods in genotype-phenotype (GP) maps are widely believed to be more likely to share characteristics than expected from random chance. Such genetic correlations should strongly influence evolutionary dynamics. We explore and quantify these intuitions by comparing three GP maps-a model for RNA secondary structure, the HP model for protein tertiary structure, and the Polyomino model for protein quaternary structure-to a simple random null model that maintains the number of genotypes mapping to each phenotype, but assigns genotypes randomly. The mutational neighbourhood of a genotype in these GP maps is much more likely to contain genotypes mapping to the same phenotype than in the random null model. Such neutral correlations can be quantified by the robustness to mutations, which can be many orders of magnitude larger than that of the null model, and crucially, above the critical threshold for the formation of large neutral networks of mutationally connected genotypes which enhance the capacity for the exploration of phenotypic novelty. Thus neutral correlations increase evolvability. We also study non-neutral correlations: Compared to the null model, i) If a particular (non-neutral) phenotype is found once in the 1-mutation neighbourhood of a genotype, then the chance of finding that phenotype multiple times in this neighbourhood is larger than expected; ii) If two genotypes are connected by a single neutral mutation, then their respective non-neutral 1-mutation neighbourhoods are more likely to be similar; iii) If a genotype maps to a folding or self-assembling phenotype, then its non-neutral neighbours are less likely to be a potentially deleterious non-folding or non-assembling phenotype. Non-neutral correlations of type i) and ii) reduce the rate at which new phenotypes can be found by neutral exploration, and so may diminish evolvability, while non-neutral correlations of type iii) may instead facilitate evolutionary exploration and so increase evolvability.This work was funded under EP/P504287/1 by the Engineering and Physical Sciences Research Council (https://www.epsrc.ac.uk). SEA is supported by The Royal Society (https://royalsociety.org/).This is the final version of the article. It first appeared from PLOS via http://dx.doi.org/10.1371/journal.pcbi.100477

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Apollo (Cambridge)

FigShare

Maximum mutational robustness in genotype-phenotype maps follows a self-similar blancmange-like curve

Author: Ahnert Sebastian E
Dingle Kamaludin
Greenbury Sam F
Louis Ard A
Mohanty Vaibhav
Narayanan Shyam
Sarkany Tasmin
Publication venue: Royal Society
Publication date: 26/07/2023
Field of study

Phenotype robustness, defined as the average mutational robustness of all the genotypes that map to a given phenotype, plays a key role in facilitating neutral exploration of novel phenotypic variation by an evolving population. By applying results from coding theory, we prove that the maximum phenotype robustness occurs when genotypes are organized as bricklayer's graphs, so-called because they resemble the way in which a bricklayer would fill in a Hamming graph. The value of the maximal robustness is given by a fractal continuous everywhere but differentiable nowhere sums-of-digits function from number theory. Interestingly, genotype-phenotype maps for RNA secondary structure and the hydrophobic-polar (HP) model for protein folding can exhibit phenotype robustness that exactly attains this upper bound. By exploiting properties of the sums-of-digits function, we prove a lower bound on the deviation of the maximum robustness of phenotypes with multiple neutral components from the bricklayer's graph bound, and show that RNA secondary structure phenotypes obey this bound. Finally, we show how robustness changes when phenotypes are coarse-grained and derive a formula and associated bounds for the transition probabilities between such phenotypes

Oxford University Research Archive

Reply to Ocklenburg and Mundorf: the interplay of developmental bias and natural selection

Author: Ahnert Sebastian E
Camargo Chico Q
Dingle Kamaludin
Doye Jonathan
Greenbury Sam F
Johnston Iain G
Louis Adriaan
Publication venue: National Academy of Sciences
Publication date: 05/07/2022
Field of study

Oxford University Research Archive

Learning machines for health and beyond

Author: Abroshan Mahed
Giles Oscar
Greenbury Sam
Roberts Jack
Steyn Jannetta S
van der Schaar Mihaela
Wilson Alan
Yong May
Publication venue
Publication date: 10/04/2023
Field of study

Machine learning techniques are effective for building predictive models because they are good at identifying patterns in large datasets. Development of a model for complex real life problems often stops at the point of publication, proof of concept or when made accessible through some mode of deployment. However, a model in the medical domain risks becoming obsolete as soon as patient demographic changes. The maintenance and monitoring of predictive models post-publication is crucial to guarantee their safe and effective long term use. As machine learning techniques are effectively trained to look for patterns in available datasets, the performance of a model for complex real life problems will not peak and remain fixed at the point of publication or even point of deployment. Rather, data changes over time, and they also changed when models are transported to new places to be used by new demography.Comment: 12 pages, 3 figure

arXiv.org e-Print Archive