Search CORE

34 research outputs found

Quantitative CMR population imaging on 20,000 subjects of the UK Biobank imaging study: LV/RV quantification pipeline and its evaluation

Author: Albà X
Attar R
Aung N
de Vila MH
Frangi AF
Fung K
Gooya A
Lee AM
Lukaschuk E
Neubauer S
Paiva JM
Pereañez M
Petersen SE
Piechnik SK
Sanghvi MM
Zhang L
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Population imaging studies generate data for developing and implementing personalised health strategies to prevent, or more effectively treat disease. Large prospective epidemiological studies acquire imaging for pre-symptomatic populations. These studies enable the early discovery of alterations due to impending disease, and enable early identification of individuals at risk. Such studies pose new challenges requiring automatic image analysis. To date, few large-scale population-level cardiac imaging studies have been conducted. One such study stands out for its sheer size, careful implementation, and availability of top quality expert annotation; the UK Biobank (UKB). The resulting massive imaging datasets (targeting ca. 100,000 subjects) has put published approaches for cardiac image quantification to the test. In this paper, we present and evaluate a cardiac magnetic resonance (CMR) image analysis pipeline that properly scales up and can provide a fully automatic analysis of the UKB CMR study. Without manual user interactions, our pipeline performs end-to-end image analytics from multi-view cine CMR images all the way to anatomical and functional bi-ventricular quantification. All this, while maintaining relevant quality controls of the CMR input images, and resulting image segmentations. To the best of our knowledge, this is the first published attempt to fully automate the extraction of global and regional reference ranges of all key functional cardiovascular indexes, from both left and right cardiac ventricles, for a population of 20,000 subjects imaged at 50 time frames per subject, for a total of one million CMR volumes. In addition, our pipeline provides 3D anatomical bi-ventricular models of the heart. These models enable the extraction of detailed information of the morphodynamics of the two ventricles for subsequent association to genetic, omics, lifestyle habits, exposure information, and other information provided in population imaging studies. We validated our proposed CMR analytics pipeline against manual expert readings on a reference cohort of 4620 subjects with contour delineations and corresponding clinical indexes. Our results show broad significant agreement between the manually obtained reference indexes, and those automatically computed via our framework. 80.67% of subjects were processed with mean contour distance of less than 1 pixel, and 17.50% with mean contour distance between 1 and 2 pixels. Finally, we compare our pipeline with a recently published approach reporting on UKB data, and based on deep learning. Our comparison shows similar performance in terms of segmentation accuracy with respect to human experts

Lirias

Southampton (e-Prints Soton)

Oxford University Research Archive

Enlighten

The University of Manchester - Institutional Repository

Queen Mary Research Online

White Rose Research Online

Structure and Age Jointly Influence Rates of Protein Evolution

Author: A Vishnoi
AE Hirsh
AE Lobkovsky
B Krishnamoorthy
C Deutsch
C Notredame
C Pál
C Pál
C Toft
C-H Kuo
CD Bustamante
CW Carter
DA Drummond
DA Drummond
DA Drummond
David Bostick
DC Ramsey
DJ Lipman
DP Wall
E Ferrada
E Franzosa
EA Franzosa
G Marais
G Wang
GC Conant
HB Fraser
HH Gan
HM Berman
I-G Choi
JB Plotkin
JD Bloom
JD Bloom
JD Bloom
JJ Cai
JJ Cai
JL England
Joshua B. Plotkin
K Katoh
LA Mirny
M Masso
M Simon
M Toll-Riera
M Toll-Riera
M. Mar Albà
Macarena Toll-Riera
MM Albà
MW Hahn
N Goldman
N Wingreen
P Flicek
P Green
P Wong
R Samudrala
Ruben E. Valas
S Miller
SF Altschul
T Domazet-Loso
T Zhou
V Daubin
W Kabsch
Y Feng
YI Wolf
Z Yang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

What factors determine a protein's rate of evolution are actively debated. Especially unclear is the relative role of intrinsic factors of present-day proteins versus historical factors such as protein age. Here we study the interplay of structural properties and evolutionary age, as determinants of protein evolutionary rate. We use a large set of one-to-one orthologs between human and mouse proteins, with mapped PDB structures. We report that previously observed structural correlations also hold within each age group – including relationships between solvent accessibility, designabililty, and evolutionary rates. However, age also plays a crucial role: age modulates the relationship between solvent accessibility and rate. Additionally, younger proteins, despite being less designable, tend to evolve faster than older proteins. We show that previously reported relationships between age and rate cannot be explained by structural biases among age groups. Finally, we introduce a knowledge-based potential function to study the stability of proteins through large-scale computation. We find that older proteins are more stable for their native structure, and more robust to mutations, than younger ones. Our results underscore that several determinants, both intrinsic and historical, can interact to determine rates of protein evolution

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Phylostratigraphic tracking of cancer genes suggests a link to the emergence of multicellularity in metazoa

Author: A Rokas
A Traulsen
AA Schäffer
AM Leroi
B Schierwater
CW Dunn
D Rambaldi
DF Squires
DG Knowles
Diethard Tautz
EA Sperling
F Delsuc
F Delsuc
F Michor
G Caetano-Anollés
GS Chang
H Philippe
H Rajagopalan
IG Choi
IP Touw
K Khalturin
KW Kinzler
L Altenberg
L Rivals Personnaz
LM Merlo
M Toll-Riera
MA Nowak
MD Herron
MJ Telford
MM Albà
MR Stratton
N King
OM Sieber
PA Futreal
PB Rainey
PF Jonsson
RA Studer
RA Weinberg
RK Grosberg
RU Jänicke
S Bamford
S Jones
S Okasha
T Domazet-Lošo
T Domazet-Lošo
T Domazet-Lošo
Tomislav Domazet-Lošo
WM Abdel-Rahman
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Phylostratigraphy is a method used to correlate the evolutionary origin of founder genes (that is, functional founder protein domains) of gene families with particular macroevolutionary transitions. It is based on a model of genome evolution that suggests that the origin of complex phenotypic innovations will be accompanied by the emergence of such founder genes, the descendants of which can still be traced in extant organisms. The origin of multicellularity can be considered to be a macroevolutionary transition, for which new gene functions would have been required. Cancer should be tightly connected to multicellular life since it can be viewed as a malfunction of interaction between cells in a multicellular organism. A phylostratigraphic tracking of the origin of cancer genes should, therefore, also provide insights into the origin of multicellularity. Results: We find two strong peaks of the emergence of cancer related protein domains, one at the time of the origin of the first cell and the other around the time of the evolution of the multicellular metazoan organisms. These peaks correlate with two major classes of cancer genes, the 'caretakers', which are involved in general functions that support genome stability and the 'gatekeepers', which are involved in cellular signalling and growth processes. Interestingly, this phylogenetic succession mirrors the ontogenetic succession of tumour progression, where mutations in caretakers are thought to precede mutations in gatekeepers. Conclusions: A link between multicellularity and formation of cancer has often been predicted. However, this has not so far been explicitly tested. Although we find that a significant number of protein domains involved in cancer predate the origin of multicellularity, the second peak of cancer protein domain emergence is, indeed, connected to a phylogenetic level where multicellular animals have emerged. The fact that we can find a strong and consistent signal for this second peak in the phylostratigraphic map implies that a complex multi-level selection process has driven the transition to multicellularity

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Profiles of low complexity regions in Apicomplexa

Author: A Coletta
AK Dunker
AL Hughes
Ananias A. Escalante
AR Dalby
B Efron
CJ Brown
CJ Brown
David Fisher
DG King
E Pizzi
E Salichs
F Verra
F Yu
Fabia U. Battistuzzi
GB Golding
H Ellegren
H Luo
HY Xue
I Kirmitzoglou
JC Wootton
JC Wootton
JM Hancock
JR Gatchel
JW Fondon
K Tamura
Kristan A. Schneider
L Li
L Mularoni
L Mularoni
M Frugier
M Simon
M Toll-Riera
MA DePristo
MA Huntley
MA Huntley
Matthew K. Spencer
MG Kendall
MM Albà
MM Zilversmit
MO Press
MU Ferreira
NG Faux
NG Faux
P Romero
P Tompa
PE Midford
R Gemayel
R Gemayel
RS Nussenzweig
SAS Institute Inc
SK Behura
Sophia Chaudhry
W Haerty
W Haerty
Y Kashi
Y Nakachi
Y-C Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Survey of glycine-rich proteins (GRPs) in the Eucalyptus expressed sequence tag database (ForEST)

Author: Albà MM
Albà MM
Allagulova ChR
Allona I
Altschul SF
Amanda Mangeon
Bagni C
Cassab GI
Claudia Magioli
Condit CM
Condit CM
de Oliveira DE
Ferreira MA
Franco LO
Freire MA
Fusaro A
Gilberto Sachetto-Martins
Hanano S
Hirose T
Karlson D
Karlson D
Keller B
Keller B
Krecic AM
Kumar S
Lambermon MH
Le Provost G
Lorkovic ZJ
Magioli C
Matsushima N
Murphy DJ
Murphy DJ
Ni Z
Obokata J
Park AR
Ricardo Magrani Junqueira
Ringli C
Rogério Margis
Ryser U
Ryser U
Sachetto-Martins G
Sachetto-Martins G
Sandal NN
Showalter AM
Silvia Nora Bocca
Sitnikova T
Thompson JD
Vanessa Cardeal
Verica JA
Welin BV
Zhang YI
Publication venue: 'FapUNIFESP (SciELO)'
Publication date: 01/01/2005
Field of study

Crossref

Evidence for the additions of clustered interacting nodes during the evolution of protein interaction networks from network motifs

Abstract Background High-throughput screens have revealed large-scale protein interaction networks defining most cellular functions. How the proteins were added to the protein interaction network during its growth is a basic and important issue. Network motifs represent the simplest building blocks of cellular machines and are of biological significance. Results Here we study the evolution of protein interaction networks from the perspective of network motifs. We find that in current protein interaction networks, proteins of the same age class tend to form motifs and such co-origins of motif constituents are affected by their topologies and biological functions. Further, we find that the proteins within motifs whose constituents are of the same age class tend to be densely interconnected, co-evolve and share the same biological functions, and these motifs tend to be within protein complexes. Conclusions Our findings provide novel evidence for the hypothesis of the additions of clustered interacting nodes and point out network motifs, especially the motifs with the dense topology and specific function may play important roles during this process. Our results suggest functional constraints may be the underlying driving force for such additions of clustered interacting nodes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Insight into Role of Selection in the Evolution of Polyglutamine Tracts in Humans

Author: A Albrecht
A Keitti
AW Dunah
CJ Brown
F Cruz
GB Golding
H Green
Hongwei Li
HT Orr
Jing Liu
JM Fondon
JM Hancock
Keliang Wu
L Mularoni
L Mularoni
M Simon
MF Perutz
MH Schaefer
MM Albà
MM Albà
MM Albà
MW Kim
NG Faux
PE Wright
R Matteo
Reiner Albert Veitia
RN Freiman
S Karlin
Yang Ziheng
Yang Ziheng
Yang Ziheng
Yang Ziheng
Yuan Chen
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

Evolutionary conservation and selection of human disease gene orthologs in the rat and mouse genomes.

Author: Albà MM
Cooper DN
Fechtel K
Goodstadt L
Huang H
Ponting CP
Smith D
Stenson PD
Wang H
Weinstock KG
Winter EE
Xing H
Publication venue
Publication date: 01/01/2004
Field of study

BACKGROUND: Model organisms have contributed substantially to our understanding of the etiology of human disease as well as having assisted with the development of new treatment modalities. The availability of the human, mouse and, most recently, the rat genome sequences now permit the comprehensive investigation of the rodent orthologs of genes associated with human disease. Here, we investigate whether human disease genes differ significantly from their rodent orthologs with respect to their overall levels of conservation and their rates of evolutionary change. RESULTS: Human disease genes are unevenly distributed among human chromosomes and are highly represented (99.5%) among human-rodent ortholog sets. Differences are revealed in evolutionary conservation and selection between different categories of human disease genes. Although selection appears not to have greatly discriminated between disease and non-disease genes, synonymous substitution rates are significantly higher for disease genes. In neurological and malformation syndrome disease systems, associated genes have evolved slowly whereas genes of the immune, hematological and pulmonary disease systems have changed more rapidly. Amino-acid substitutions associated with human inherited disease occur at sites that are more highly conserved than the average; nevertheless, 15 substituting amino acids associated with human disease were identified as wild-type amino acids in the rat. Rodent orthologs of human trinucleotide repeat-expansion disease genes were found to contain substantially fewer of such repeats. Six human genes that share the same characteristics as triplet repeat-expansion disease-associated genes were identified; although four of these genes are expressed in the brain, none is currently known to be associated with disease. CONCLUSIONS: Most human disease genes have been retained in rodent genomes. Synonymous nucleotide substitutions occur at a higher rate in disease genes, a finding that may reflect increased mutation rates in the chromosomal regions in which disease genes are found. Rodent orthologs associated with neurological function exhibit the greatest evolutionary conservation; this suggests that rodent models of human neurological disease are likely to most faithfully represent human disease processes. However, with regard to neurological triplet repeat expansion-associated human disease genes, the contraction, relative to human, of rodent trinucleotide repeats suggests that rodent loci may not achieve a 'critical repeat threshold' necessary to undergo spontaneous pathological repeat expansions. The identification of six genes in this study that have multiple characteristics associated with repeat expansion-disease genes raises the possibility that not all human loci capable of facilitating neurological disease by repeat expansion have as yet been identified

Oxford University Research Archive

Systematic Analysis of Compositional Order of Proteins Reveals New Characteristics of Biological Functions and a Universal Correlate of Macroevolution

Crossref

Uncovering de novo gene birth in yeast using deep transcriptomics

Author: A McLysaght
A-R Carvunis
AE Almada
AL Delcher
AM Tsankov
AP Gasch
B Dujon
B Wu
BA Wilson
C Schlötterer
C Trapnell
CW Carter
D Li
D Li
D Osorio
D Tautz
DG Knowles
DJ Begun
E Durand
E Elhaik
EL van Dijk
F Huber
F Madeo
F Sievers
G Oshiro
H Neil
HT Baalsrud
J Cai
J Camblong
J Chen
J Luis Villanueva-Cañas
J Ruiz-Orera
J Ruiz-Orera
J Ruiz-Orera
J Ruiz-Orera
JA Reinhardt
JF Schmitz
K Amako
L Zhang
L Zhao
M Kato
M Morgenstern
M Toll-Riera
ME Dinger
MG Grabherr
MM Albà
MM Albà
MM Pena
MS Werner
MT Levine
N Palmieri
N Vakirlis
N Vakirlis
N Vakirlis
NT Ingolia
NT Ingolia
P Majic
P Uetz
R Smith-Unna
S Fogel
SA Slavoff
SF Altschul
SF Nothwehr
SN Rodin
T Domazet-Lošo
T-C Lu
TD Wu
TJ Treangen
V Pelechano
V Sophianopoulou
W Huber
WR Blevins
WR Blevins
X Zhuang
Z Ji
Z Xu
ZW Arendsee
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

De novo gene origination has been recently established as an important mechanism for the formation of new genes. In organisms with a large genome, intergenic and intronic regions provide plenty of raw material for new transcriptional events to occur, but little is know about how de novo transcripts originate in more densely-packed genomes. Here, we identify 213 de novo originated transcripts in Saccharomyces cerevisiae using deep transcriptomics and genomic synteny information from multiple yeast species grown in two different conditions. We find that about half of the de novo transcripts are expressed from regions which already harbor other genes in the opposite orientation; these transcripts show similar expression changes in response to stress as their overlapping counterparts, and some appear to translate small proteins. Thus, a large fraction of de novo genes in yeast are likely to co-evolve with already existing genes.The work was funded by grants PGC2018-094091-B-I00, BFU2015-65235-P, BFU2015-68351-P, BFU2016-80039-R, TIN2015-69175-C4-3-R and RTI2018-094403-B-C33 from Spanish Government—FEDER (EU), and from grant PT17/0009/0014 from Instituto de Salud Carlos III—FEDER. We also received funding from the “Maria de Maeztu” Programme for Units of Excellence in R&D (MDM-2014-0370) and from Agència de Gestió d’Ajuts Universitaris i de Recerca Generalitat de Catalunya (AGAUR), grants number 2014SGR1121, 2014SGR0974, 2017SGR1054 and 2017SGR01020 and, predoctoral fellowship (FI) to W.R.B

Crossref

UPCommons. Portal del coneixement obert de la UPC

UPF Digital Repository