Search CORE

eScholarship - University of California

Somatic Mutations Reveal Lineage Relationships and Age-Related Mutagenesis in Human Hematopoiesis

Author: Abkowitz
Alexandrov
Alexandrov
Alexandrov
Behjati
Blokzijl
Blokzijl
Bowie
Carrelha
Catlin
DePristo
Foudi
Genovese
Haas
Hiatt
Inman
Jager
Jaiswal
Ju
Karran
Laurenti
Lee-Six
Li
Li
Lodato
Mardis
Notta
Notta
Oguro
Passegué
Pei
Pleasance
Rodriguez-Fraticelli
Rossi
Stratton
Welch
Wilson
Xie
Yamamoto
Yu
Zink
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Mutation accumulation during life can contribute to hematopoietic dysfunction; however, the underlying dynamics are unknown. Somatic mutations in blood progenitors can provide insight into the rate and processes underlying this accumulation, as well as the developmental lineage tree and stem cell division numbers. Here,we catalog mutations in the genomes of human-bone-marrow-derived and umbilical-cordblood- derived hematopoietic stem and progenitor cells (HSPCs). We find that mutations accumulate gradually during life with approximately 14 base substitutions per year. The majority of mutations were acquired after birth and could be explained by the constant activity of various endogenous mutagenic processes, which also explains the mutation load in acute myeloid leukemia (AML). Using these mutations, we construct a developmental lineage tree of human hematopoiesis, revealing a polyclonal architecture and providing evidence that developmental clones exhibit multipotency. Our approach highlights features of human native hematopoiesis and its implications for leukemogenesis.The authors would like to thank the Hartwig Medical Foundation (Amsterdam, the Netherlands) for facilitating low-input whole-genome sequencing, P.J. Coffer for providing umbilical cord blood samples, and P.J. Campbell and D.C. Wedge for sharing scripts. This study was financially supported by an EMBO long-term fellowship to F.G.O. (ALTF 655-2016), an ERC starting grant (ERC2014-STG637904) to I.V., a VIDI grant of the Netherlands Organisation for Scientific Research (NWO) (no. 016.Vidi.171.023) to R.v.B., funding from Worldwide Cancer Research (WCR) (no. 16-0193) to R.v.B., and NIH grants HL128850-01A1 and P01HL13147 to F.D.C. F.D.C. is a scholar of the Howard Hughes Medical Institute and the Leukemia and Lymphoma Society

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UCrea

Digital.CSIC

Somatic Mutations Reveal Lineage Relationships and Age-Related Mutagenesis in Human Hematopoiesis

Author: Abkowitz
Alexandrov
Alexandrov
Alexandrov
Behjati
Blokzijl
Blokzijl
Bowie
Carrelha
Catlin
DePristo
Foudi
Genovese
Haas
Hiatt
Inman
Jager
Jaiswal
Ju
Karran
Laurenti
Lee-Six
Li
Li
Lodato
Mardis
Notta
Notta
Oguro
Passegué
Pei
Pleasance
Rodriguez-Fraticelli
Rossi
Stratton
Welch
Wilson
Xie
Yamamoto
Yu
Zink
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

South East Academic Libraries System (SEALS)

UCrea

Digital.CSIC

Enlighten

Rhodes Repository (SEALS)

University of St. Andrews - Pure

St Andrews Research Repository

Potential for early warning of viral influenza activity in the community by monitoring clinical diagnoses of influenza in hospital emergency departments

Author: B Miller
C Bowie
C Sonesson
C Viboud
David J Muscatello
DG Wolf
DJ Muscatello
E Vergu
FT Bourgeois
GEP Box
GM Ljung
HA Johnson
J Stein
JS Brownstein
JU Espino
K Moore
MA Marx
ML Bell
MS O'Neill
N Marsden-Haug
PJ Diggle
PV Effler
R Gilmour
R Heffernan
R Peng
R Serfling
RH Shumway
Robert Aitken
SD Collins
Tim Churches
WB Lober
Wei Zheng
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Although syndromic surveillance systems are gaining acceptance as useful tools in public health, doubts remain about whether the anticipated early warning benefits exist. Many assessments of this question do not adequately account for the confounding effects of autocorrelation and trend when comparing surveillance time series and few compare the syndromic data stream against a continuous laboratory-based standard. We used time series methods to assess whether monitoring of daily counts of Emergency Department (ED) visits assigned a clinical diagnosis of influenza could offer earlier warning of increased incidence of viral influenza in the population compared with surveillance of daily counts of positive influenza test results from laboratories. Methods For the five-year period 2001 to 2005, time series were assembled of ED visits assigned a provisional ED diagnosis of influenza and of laboratory-confirmed influenza cases in New South Wales (NSW), Australia. Poisson regression models were fitted to both time series to minimise the confounding effects of trend and autocorrelation and to control for other calendar influences. To assess the relative timeliness of the two series, cross-correlation analysis was performed on the model residuals. Modelling and cross-correlation analysis were repeated for each individual year. Results Using the full five-year time series, short-term changes in the ED time series were estimated to precede changes in the laboratory series by three days. For individual years, the estimate was between three and 18 days. The time advantage estimated for the individual years 2003–2005 was consistently between three and four days. Conclusion Monitoring time series of ED visits clinically diagnosed with influenza could potentially provide three days early warning compared with surveillance of laboratory-confirmed influenza. When current laboratory processing and reporting delays are taken into account this time advantage is even greater.</p

Springer - Publisher Connector

arXiv.org e-Print Archive

Nature of protein family signatures: Insights from singular value analysis of position-specific scoring matrices

Author: A Bundi
A Kidera
AG Murzin
Akira R. Kinjo
AR Kinjo
AR Kinjo
AR Kinjo
AR Kinjo
AR Kinjo
AR Knjo
B Qian
B Rost
BE Suzek
C Barber
C Rosano
D Bashford
David Jones
DT Jones
DT Jones
F Beghin
FM Richards
G Wang
Haruki Nakamura
HM Berman
J Kyte
JL Fauchère
JO Wrabl
JT Lecomte
JU Bowie
JU Bowie
K Nakai
K Nishikawa
K Nishikawa
K Tomii
M Charton
M Gribskov
M Kann
M Levitt
M Oobatake
M Ota
M Ota
M Porto
MG Rudolph
MO Dayhoff
P Klein
P Koehl
P Pokarowski
PHA Sneath
R Aurora
R Durbin
R Grantham
RA Horn
RD Finn
RF Doolittle
RM Sweet
S Fukuchi
S Henikoff
S Kawashima
S Miyazawa
SF Altschul
SF Altschul
SR Eddy
T Ishida
TM Cover
U Bastolla
WE Royer Jr
WR Taylor
Z Yuan
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 07/11/2007
Field of study

Position-specific scoring matrices (PSSMs) are useful for detecting weak homology in protein sequence analysis, and they are thought to contain some essential signatures of the protein families. In order to elucidate what kind of ingredients constitute such family-specific signatures, we apply singular value decomposition to a set of PSSMs and examine the properties of dominant right and left singular vectors. The first right singular vectors were correlated with various amino acid indices including relative mutability, amino acid composition in protein interior, hydropathy, or turn propensity, depending on proteins. A significant correlation between the first left singular vector and a measure of site conservation was observed. It is shown that the contribution of the first singular component to the PSSMs act to disfavor potentially but falsely functionally important residues at conserved sites. The second right singular vectors were highly correlated with hydrophobicity scales, and the corresponding left singular vectors with contact numbers of protein structures. It is suggested that sequence alignment with a PSSM is essentially equivalent to threading supplemented with functional information. The presented method may be used to separate functionally important sites from structurally important ones, and thus it may be a useful tool for predicting protein functions.Comment: 22 pages, 7 figures, 4 table

CiteSeerX

Public Library of Science (PLOS)

Public Library of Science (PLOS)

Molecular Basis of NDM-1, a New Antibiotic Resistance Determinant

Author: C Bebrone
C Moali
Cai Guang-Yang
Cheng Luo
D Yong
H Park
H Zhang
Hong Liu
Hualiang Jiang
I Garcia-Saez
JM Rolain
JU Bowie
K Yu
Kaixian Chen
Keqin Kathy Li
KK Kumarasamy
Lefu Lan
Lianchun Li
Limin Chen
Lin Mei
Mingyue Zheng
P Lassaux
P Lassaux
RA Friesner
RA Laskowski
S Bounaga
SF Altschul
Xiangqian Kong
Xu Shen
Y Guo
Y Yamaguchi
Yao Hong
Yuanyuan Wang
Z Wang
Z Wang
Zhongjie Liang
Publication venue: Public Library of Science
Publication date: 24/08/2011
Field of study

The New Delhi Metallo-β-lactamase (NDM-1) was first reported in 2009 in a Swedish patient. A recent study reported that Klebsiella pneumonia NDM-1 positive strain or Escherichia coli NDM-1 positive strain was highly resistant to all antibiotics tested except tigecycline and colistin. These can no longer be relied on to treat infections and therefore, NDM-1 now becomes potentially a major global health threat

Structure-based statistical analysis of transmembrane helices

Author: A Holt
A Saurí
A Senes
ACV Johansson
Carlos Baeza-Delgado
DE Engel
E Wallin
FS Cordes
G Heijne von
HJ Sharpe
HM Berman
I Nilsson
I Nilsson
Ismael Mingarro
IT Arkin
J Nilsson
JL MacCallum
JU Bowie
K Illergård
L Martínez-Gil
L Pal
M Blaber
M Eilers
M Lerch-Bader
M Orzáez
MA Lomize
Marc A. Marti-Renom
MB Ulmschneider
MB Ulmschneider
RW Williams
S Jayasinghe
S Jayasinghe
SC Li
SH White
SH White
SH White
T Hessa
T Hessa
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Recent advances in determination of the high-resolution structure of membrane proteins now enable analysis of the main features of amino acids in transmembrane (TM) segments in comparison with amino acids in water-soluble helices. In this work, we conducted a large-scale analysis of the prevalent locations of amino acids by using a data set of 170 structures of integral membrane proteins obtained from the MPtopo database and 930 structures of water-soluble helical proteins obtained from the protein data bank. Large hydrophobic amino acids (Leu, Val, Ile, and Phe) plus Gly were clearly prevalent in TM helices whereas polar amino acids (Glu, Lys, Asp, Arg, and Gln) were less frequent in this type of helix. The distribution of amino acids along TM helices was also examined. As expected, hydrophobic and slightly polar amino acids are commonly found in the hydrophobic core of the membrane whereas aromatic (Trp and Tyr), Pro, and the hydrophilic amino acids (Asn, His, and Gln) occur more frequently in the interface regions. Charged amino acids are also statistically prevalent outside the hydrophobic core of the membrane, and whereas acidic amino acids are frequently found at both cytoplasmic and extra-cytoplasmic interfaces, basic amino acids cluster at the cytoplasmic interface. These results strongly support the experimentally demonstrated biased distribution of positively charged amino acids (that is, the so-called the positive-inside rule) with structural data

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositori d'Objectes Digitals per a l'Ensenyament la Recerca i la Cultura

Accurate and efficient gp120 V3 loop structure based models for the determination of HIV-1 co-receptor usage

Author: AJ Low
B Dau
CB Barber
DR Briggs
DR Kuritzkes
E Frank
EA Berger
EF Pettersen
EM Fenyo
G Wang
GE Crooks
H Scheib
H Schuitemaker
HB Bernstein
HM Berman
II Vaisman
Iosif I Vaisman
JD Rose
JJ De Jong
JU Bowie
M Masso
M Masso
M Masso
M Masso
M Masso
M Masso
M Sharon
MA Jensen
MA Jensen
Majid Masso
MC Prosperi
MJ Sippl
O Sander
P Dorr
S Pillai
S Xu
T Sing
T Watabe
W Resch
WF Vranken
Y Huang
Y Wu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background HIV-1 targets human cells expressing both the CD4 receptor, which binds the viral envelope glycoprotein gp120, as well as either the CCR5 (R5) or CXCR4 (X4) co-receptors, which interact primarily with the third hypervariable loop (V3 loop) of gp120. Determination of HIV-1 affinity for either the R5 or X4 co-receptor on host cells facilitates the inclusion of co-receptor antagonists as a part of patient treatment strategies. A dataset of 1193 distinct gp120 V3 loop peptide sequences (989 R5-utilizing, 204 X4-capable) is utilized to train predictive classifiers based on implementations of random forest, support vector machine, boosted decision tree, and neural network machine learning algorithms. An <it>in silico </it>mutagenesis procedure employing multibody statistical potentials, computational geometry, and threading of variant V3 sequences onto an experimental structure, is used to generate a feature vector representation for each variant whose components measure environmental perturbations at corresponding structural positions. Results Classifier performance is evaluated based on stratified 10-fold cross-validation, stratified dataset splits (2/3 training, 1/3 validation), and leave-one-out cross-validation. Best reported values of sensitivity (85%), specificity (100%), and precision (98%) for predicting X4-capable HIV-1 virus, overall accuracy (97%), Matthew's correlation coefficient (89%), balanced error rate (0.08), and ROC area (0.97) all reach critical thresholds, suggesting that the models outperform six other state-of-the-art methods and come closer to competing with phenotype assays. Conclusions The trained classifiers provide instantaneous and reliable predictions regarding HIV-1 co-receptor usage, requiring only translated V3 loop genotypes as input. Furthermore, the novelty of these computational mutagenesis based predictor attributes distinguishes the models as orthogonal and complementary to previous methods that utilize sequence, structure, and/or evolutionary information. The classifiers are available online at <url>http://proteins.gmu.edu/automute</url>.</p

Springer - Publisher Connector

Fully automated high-quality NMR structure determination of small 2H-enriched proteins

Author: A Bahrami
A Bax
A Bhattacharya
A Grishaev
AT Brunger
CA Rohl
D Zheng
David Baker
DE Zimmerman
F Delaglio
G Cornilescu
G Kontaxis
G Wider
Gaetano T. Montelione
H Schindelin
HN Moseley
JP Linge
JU Bowie
K Newkirk
KH Gardner
KH Gardner
M Suzuki
M Suzuki
M Suzuki
Masayori Inouye
MJ Sippl
Monica J. Roth
NK Goto
RA Laskowski
S Grzesiek
S Raman
S Raman
SC Lovell
Srivatsan Raman
T Yamazaki
TD Goddard
W Feng
William M. Schneider
WM Schneider
X Shan
Y Shen
Yang Shen
YJ Huang
YJ Huang
YS Jung
Yuefeng Tang
Publication venue: Springer Netherlands
Publication date: 01/01/2010
Field of study

Determination of high-quality small protein structures by nuclear magnetic resonance (NMR) methods generally requires acquisition and analysis of an extensive set of structural constraints. The process generally demands extensive backbone and sidechain resonance assignments, and weeks or even months of data collection and interpretation. Here we demonstrate rapid and high-quality protein NMR structure generation using CS-Rosetta with a perdeuterated protein sample made at a significantly reduced cost using new bacterial culture condensation methods. Our strategy provides the basis for a high-throughput approach for routine, rapid, high-quality structure determination of small proteins. As an example, we demonstrate the determination of a high-quality 3D structure of a small 8 kDa protein, E. coli cold shock protein A (CspA), using <4 days of data collection and fully automated data analysis methods together with CS-Rosetta. The resulting CspA structure is highly converged and in excellent agreement with the published crystal structure, with a backbone RMSD value of 0.5 Å, an all atom RMSD value of 1.2 Å to the crystal structure for well-defined regions, and RMSD value of 1.1 Å to crystal structure for core, non-solvent exposed sidechain atoms. Cross validation of the structure with 15N- and 13C-edited NOESY data obtained with a perdeuterated 15N, 13C-enriched 13CH3 methyl protonated CspA sample confirms that essentially all of these independently-interpreted NOE-based constraints are already satisfied in each of the 10 CS-Rosetta structures. By these criteria, the CS-Rosetta structure generated by fully automated analysis of data for a perdeuterated sample provides an accurate structure of CspA. This represents a general approach for rapid, automated structure determination of small proteins by NMR

Springer - Publisher Connector