Search CORE

79 research outputs found

Prediction of peptide and protein propensity for amyloid formation

Author: A Quintas
A Trovato
A Trovato
AC Davison
AC Tsolis
Alexandre Quintas
AM Fernandez-Escamilla
AP Pawar
AV Finkelstein
B Rost
C Nerelius
Carlos Família
CM Dobson
D Eisenberg
David A. Phoenix
DJ Selkoe
DM Fowler
Eugene A. Permyakov
F Chiti
F Chiti
F Sasagawa
GG Tartaglia
GG Tartaglia
H Hu
I Cherny
I Walsh
IV Baskakov
J Palau
J Tian
JC Rochet
JD Sipe
JM Zimmerman
JW Kelly
JW Kelly
K Rajagopal
KF DuBay
KK Frousios
KT O’Neil
L Goldschmidt
LO Jimenez
M Belli
M Emily
M Hollander
M Kuhn
M López de la Paz
M Oliveberg
M Stefani
M Sunde
M Sunde
M Zamani
MB Kursa
MJ Thompson
MT Pastor
N Becker
N Qian
O Conchillo-Solé
PK Teng
PY Chou
RS Harrison
S Idicula-thomas
S Kawashima
S Kawashima
S Maurer-Stroh
S Ventura
S Yoon
S Yoon
Sarah R. Dennison
SJ Hamodrakas
SJ Hamodrakas
SK Maji
SO Garbuzynskiy
T Hothorn
T Hothorn
T Hothorn
T Scheibel
TPJ Knowles
VS Mathura
WH DePas
WT Astbury
Y Kallberg
Ž Eva
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 09/07/2014
Field of study

Understanding which peptides and proteins have the potential to undergo amyloid formation and what driving forces are responsible for amyloid-like fiber formation and stabilization remains limited. This is mainly because proteins that can undergo structural changes, which lead to amyloid formation, are quite diverse and share no obvious sequence or structural homology, despite the structural similarity found in the fibrils. To address these issues, a novel approach based on recursive feature selection and feed-forward neural networks was undertaken to identify key features highly correlated with the self-assembly problem. This approach allowed the identification of seven physicochemical and biochemical properties of the amino acids highly associated with the self-assembly of peptides and proteins into amyloid-like fibrils (normalized frequency of β-sheet, normalized frequency of β-sheet from LG, weights for β-sheet at the window position of 1, isoelectric point, atom-based hydrophobic moment, helix termination parameter at position j+1 and ΔGº values for peptides extrapolated in 0 M urea). Moreover, these features enabled the development of a new predictor (available at http://cran.r-project.org/web/packages/appnn/index.html) capable of accurately and reliably predicting the amyloidogenic propensity from the polypeptide sequence alone with a prediction accuracy of 84.9 % against an external validation dataset of sequences with experimental in vitro, evidence of amyloid formation

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

FigShare

Derivation of a biomass proxy for dynamic analysis of whole genome metabolic models

Author: B Palsson
C Rohr
D Gilbert
D Gilbert
DC Montgomery
GJ Baart
IH Witten
J Schellenberger
K Smallbone
M Heiner
M Heiner
M Heiner
MA Babyak
MB Kursa
O Hädicke
P Erdrich
R Mahadevan
RM O’Brien
Z King
Publication venue
Publication date: 01/01/2018
Field of study

A whole genome metabolic model (GEM) is essentially a reconstruction of a network of enzyme-enabled chemical reactions representing the metabolism of an organism, based on information present in its genome. Such models have been designed so that ﬂux balance analysis (FBA) can be applied in order to analyse metabolism under steady state. For this purpose, a biomassfunctionisaddedtothesemodelsasanoverallindicatorofthemodel’s viability. Our objective is to develop dynamic models based on these FBA models in order to observe new and complex behaviours, including transient behaviour. There is however a major challenge in that the biomass function does not operate under dynamic simulation. An appropriate biomass function would enable the estimation under dynamic simulation of the growth of both wildtype and genetically modiﬁed bacteria under diﬀerent, possibly dynamically changing growth conditions. Using data analytics techniques, we have developed a dynamic biomass function which acts as a faithful proxy for the FBA equivalent for a reduced GEM for E. coli. This involved consolidating data for reaction rates and metabolite concentrations generated under dynamic simulation with gold standard target data for biomass obtained by steady state analysis using FBA. It also led to a number of interesting insights regarding biomass ﬂuxes for pairs of conditions. These ﬁndings were reproduced in our dynamic proxy function

Crossref

Brunel University Research Archive

An experimental study of the intrinsic stability of random forest variable importance measures

Author: A Altmann
A Kalousis
A Statnikov
A Statnikov
A Verikas
AC Haury
AL Boulesteix
AL Boulesteix
CH Park
D Ma
DM Reif
DS Cao
EC Fieller
Fan Yang
H Wang
Huazhen Wang
I Guyon
I Kamkar
J Paul
JM Cadenas
KK Nicodemus
L Breiman
L Hamers
L Yu
L Yu
LI Kuncheva
MB Kursa
ML Calle
O Okun
R Díaz-Uriarte
R Fagin
R Genuer
S Alelyani
S Loscalzo
S Pleus
SS Lee
SY Kim
TK Ho
VY Kulkarni
Y Han
Y Zhang
Z He
Zhiyuan Luo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

BACKGROUND: The stability of Variable Importance Measures (VIMs) based on random forest has recently received increased attention. Despite the extensive attention on traditional stability of data perturbations or parameter variations, few studies include influences coming from the intrinsic randomness in generating VIMs, i.e. bagging, randomization and permutation. To address these influences, in this paper we introduce a new concept of intrinsic stability of VIMs, which is defined as the self-consistence among feature rankings in repeated runs of VIMs without data perturbations and parameter variations. Two widely used VIMs, i.e., Mean Decrease Accuracy (MDA) and Mean Decrease Gini (MDG) are comprehensively investigated. The motivation of this study is two-fold. First, we empirically verify the prevalence of intrinsic stability of VIMs over many real-world datasets to highlight that the instability of VIMs does not originate exclusively from data perturbations or parameter variations, but also stems from the intrinsic randomness of VIMs. Second, through Spearman and Pearson tests we comprehensively investigate how different factors influence the intrinsic stability. RESULTS: The experiments are carried out on 19 benchmark datasets with diverse characteristics, including 10 high-dimensional and small-sample gene expression datasets. Experimental results demonstrate the prevalence of intrinsic stability of VIMs. Spearman and Pearson tests on the correlations between intrinsic stability and different factors show that #feature (number of features) and #sample (size of sample) have a coupling effect on the intrinsic stability. The synthetic indictor, #feature/#sample, shows both negative monotonic correlation and negative linear correlation with the intrinsic stability, while OOB accuracy has monotonic correlations with intrinsic stability. This indicates that high-dimensional, small-sample and high complexity datasets may suffer more from intrinsic instability of VIMs. Furthermore, with respect to parameter settings of random forest, a large number of trees is preferred. No significant correlations can be seen between intrinsic stability and other factors. Finally, the magnitude of intrinsic stability is always smaller than that of traditional stability. CONCLUSION: First, the prevalence of intrinsic stability of VIMs demonstrates that the instability of VIMs not only comes from data perturbations or parameter variations, but also stems from the intrinsic randomness of VIMs. This finding gives a better understanding of VIM stability, and may help reduce the instability of VIMs. Second, by investigating the potential factors of intrinsic stability, users would be more aware of the risks and hence more careful when using VIMs, especially on high-dimensional, small-sample and high complexity datasets

Crossref

Springer - Publisher Connector

Royal Holloway - Pure

PubMed Central

Summary of the DREAM8 Parameter Estimation Challenge: Toward Parameter Identification for Whole-Cell Models

Author: Allgood BA
Barker B
Baron M
Bello J
Bernal A
Bogart E
Bot BM
Bryson K
Canas-Duarte SJ
Castro JC
Chandramohan D
Covert MW
Cygan M
DeCicco D
Gomez F
Hoff BR
Hu Y
Huang L
Karr JR
Kazakiewicz D
Kellen MR
Korytkowski P
Kreutz C
Kreutz C
Kursa MB
Leal JMP
Li SC
Li Y
Liu Y
Makadia H
Meyer P
Munoz AR
Plewczynski D
Raue A
Raue A
Restrepo S
Rozo DOB
Shestov AA
Steiert B
Steiert B
Stolovitzky GA
Swistak M
Tang H
Timmer J
Timmer J
Valdes I
Vivas LG
Wang M
Wang T
Wang Y
Wilkinson S
Williams AH
Williams AH
Xiao G
Xie Y
Yang J
Yin A
Zawack K
Zucker JD
Zucker JD
Publication venue: PUBLIC LIBRARY SCIENCE
Publication date: 28/05/2015
Field of study

Whole-cell models that explicitly represent all cellular components at the molecular level have the potential to predict phenotype from genotype. However, even for simple bacteria, whole-cell models will contain thousands of parameters, many of which are poorly characterized or unknown. New algorithms are needed to estimate these parameters and enable researchers to build increasingly comprehensive models. We organized the Dialogue for Reverse Engineering Assessments and Methods (DREAM) 8 Whole-Cell Parameter Estimation Challenge to develop new parameter estimation algorithms for whole-cell models. We asked participants to identify a subset of parameters of a whole-cell model given the model’s structure and in silico “experimental” data. Here we describe the challenge, the best performing methods, and new insights into the identifiability of whole-cell models. We also describe several valuable lessons we learned toward improving future challenges. Going forward, we believe that collaborative efforts supported by inexpensive cloud computing have the potential to solve whole-cell model parameter estimation

UCL Discovery

Urinary volatile organic compounds for the detection of prostate cancer

Author: A Roine
A Sreekumar
AD Asimakopoulos
AM Wolf
AW Boots
B Laxman
Ben De Lacy Costello
Chris S. Probert
CK Naughton
CL Silva
CL Silva
CM Willis
D Delen
D Hessels
D Pickel
E Anderssen
E Killick
EA Struys
ES Leman
F Jentzmik
FH Schroder
FK Chun
Franky L Chan
G Horvath
G Lughezzani
G Peng
G Taverna
GA Mills
GL Andriole
H Wu
Huda Al-Kateb
IM Thompson
J Groskopf
JN Cornu
JR Prensner
K Bensalah
KR Elliker
LH Rosenberg
M Kuhn
M Lazzeri
M McCulloch
M Ojala
M Schostak
MB Kursa
MJ Roobol
MK Kwiatkowski
Norman Ratcliffe
P Filzmoser
Paul White
Peter Jones
R Aggio
R Herwig
R Morgan
Raj Persad
Raphael Aggio
S Smith
S Smith
S Zhang
SA Tomlins
Tanzeela Khalid
W Filipiak
WJ Catalona
WJ Catalona
WJ Catalona
WJ Catalona
WR Klecka
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

© 2015 Khalid et al.This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. The aim of this work was to investigate volatile organic compounds (VOCs) emanating from urine samples to determine whether they can be used to classify samples into those from prostate cancer and non-cancer groups. Participants were men referred for a trans-rectal ultrasound-guided prostate biopsy because of an elevated prostate specific antigen (PSA) level or abnormal findings on digital rectal examination. Urine samples were collected from patients with prostate cancer (n = 59) and cancer-free controls (n = 43), on the day of their biopsy, prior to their procedure. VOCs from the headspace of basified urine samples were extracted using solid-phase micro-extraction and analysed by gas chromatography/mass spectrometry. Classifiers were developed using Random Forest (RF) and Linear Discriminant Analysis (LDA) classification techniques. PSA alone had an accuracy of 62-64% in these samples. A model based on 4 VOCs, 2,6-dimethyl-7-octen-2-ol, pentanal, 3-octanone, and 2-octanone, was marginally more accurate 63-65%. When combined, PSA level and these four VOCs had mean accuracies of 74% and 65%, using RF and LDA, respectively. With repeated double cross-validation, the mean accuracies fell to 71% and 65%, using RF and LDA, respectively. Results from VOC profiling of urine headspace are encouraging and suggest that there are other metabolomic avenues worth exploring which could help improve the stratification of men at risk of prostate cancer. This study also adds to our knowledge on the profile of compounds found in basified urine, from controls and cancer patients, which is useful information for future studies comparing the urine from patients with other disease states

Crossref

Directory of Open Access Journals

UWE Bristol Research Repository

PubMed Central

Coventry University Pure Portal

Explore Bristol Research

Seafloor change detection using multibeam echosounder backscatter: case study on the Belgian part of the North Sea

Author: A Liaw
A Rattray
A Singh
AJ Kenny
AK Braimoh
BS Halpern
C Moustier de
D Eleftherakis
D Ierodiaconou
DE Wells
E Verfaillie
Giacomo Montereale-Gavazzi
GM Ashley
GM Foody
GM Foody
GOA Montereale-Gavazzi
J Li
J Spearman
JA Hartigan
JB Jones
JE Hewitt
JE Hughes-Clarke
JS Houziaux
JT Anderson
Koen Degrendele
L Breiman
L Watling
M Diesing
M Diesing
M Hussain
Marc Roche
MB Kursa
Nathan Terseleer
P Holler
PD Denderen van
PJ Luyten
RE Francois
RG Pontius
RG Pontius
RG Pontius
S Degraer
S Leeuwen van
SF Thrush
V Lancker Van
V Lancker Van
V Lancker Van
V Lancker Van
V Lecours
Vera Van Lancker
VL Ferrini
X Lurton
Xavier Lurton
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Metagenomic Approach to Characterization of the Vaginal Microbiome Signature in Pregnancy

Author: A Chao
A Renyi
Adam J. Ratner
Aleksandar Milosavljevic
B Haas
B Manly
B Tóthmérész
C Favier
C Lozupone
C Quince
C Reinhardt
CA Lozupone
CE Shannon
CF Favier
CJ Brown
Cristian Coarfa
Curtis Huttenhower
D Knights
D Knights
DG Gavin
Dirk Gevers
EG Zoetendal
EK Costello
ES Charlson
FD Ciccarelli
I Letunic
Ignatia Van den Veyver
J Oksanen
J Penders
J Penders
J Qin
J Ravel
J Reeder
James Versalovic
JG Caporaso
Joseph Petrosino
JR Cole
Jun Ma
K Kurokawa
Kevin Riehle
KJ Aagaard
Kjersti Aagaard
KP Riehle
L Dethlefsen
L Qin
LJ Forney
LJ Forney
LV Hooper
M Hamady
M Li
MB Kursa
MG Dominguez-Bello
MG Dominguez-Bello
MM Gronlund
MN Price
N Fierer
N Segata
Nicola Segata
PB Eckburg
PJ Turnbaugh
PJ Turnbaugh
Q Wang
R Kindt
RD Pridmore
RE Ley
RE Ley
RE Ley
RH Whittaker
RI Mackie
Sabeen Raza
Sean Rosenbaum
SR Gill
TK Kim
Toni-Ann Mistretta
WW Oswald
X Xhou
Y Ye
Publication venue: Public Library of Science
Publication date: 13/06/2012
Field of study

While current major national research efforts (i.e., the NIH Human Microbiome Project) will enable comprehensive metagenomic characterization of the adult human microbiota, how and when these diverse microbial communities take up residence in the host and during reproductive life are unexplored at a population level. Because microbial abundance and diversity might differ in pregnancy, we sought to generate comparative metagenomic signatures across gestational age strata. DNA was isolated from the vagina (introitus, posterior fornix, midvagina) and the V5V3 region of bacterial 16S rRNA genes were sequenced (454FLX Titanium platform). Sixty-eight samples from 24 healthy gravidae (18 to 40 confirmed weeks) were compared with 301 non-pregnant controls (60 subjects). Generated sequence data were quality filtered, taxonomically binned, normalized, and organized by phylogeny and into operational taxonomic units (OTU); principal coordinates analysis (PCoA) of the resultant beta diversity measures were used for visualization and analysis in association with sample clinical metadata. Altogether, 1.4 gigabytes of data containing >2.5 million reads (averaging 6,837 sequences/sample of 493 nt in length) were generated for computational analyses. Although gravidae were not excluded by virtue of a posterior fornix pH >4.5 at the time of screening, unique vaginal microbiome signature encompassing several specific OTUs and higher-level clades was nevertheless observed and confirmed using a combination of phylogenetic, non-phylogenetic, supervised, and unsupervised approaches. Both overall diversity and richness were reduced in pregnancy, with dominance of Lactobacillus species (L. iners crispatus, jensenii and johnsonii, and the orders Lactobacillales (and Lactobacillaceae family), Clostridiales, Bacteroidales, and Actinomycetales. This intergroup comparison using rigorous standardized sampling protocols and analytical methodologies provides robust initial evidence that the vaginal microbial 16S rRNA gene catalogue uniquely differs in pregnancy, with variance of taxa across vaginal subsite and gestational age

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

FigShare