9 research outputs found
Systematic analysis of challenge-driven improvements in molecular prognostic models for breast cancer
Although molecular prognostics in breast cancer are among the most successful examples of translating genomic analysis to clinical applications, optimal approaches to breast cancer clinical risk prediction remain controversial. The Sage Bionetworks-DREAM Breast Cancer Prognosis Challenge (BCC) is a crowdsourced research study for breast cancer prognostic modeling using genome-scale data. The BCC provided a community of data analysts with a common platform for data access and blinded evaluation of model accuracy in predicting breast cancer survival on the basis of gene expression data, copy number data, and clinical covariates. This approach offered the opportunity to assess whether a crowdsourced community Challenge would generate models of breast cancer prognosis commensurate with or exceeding current best-in-class approaches. The BCC comprised multiple rounds of blinded evaluations on held-out portions of data on 1981 patients, resulting in more than 1400 models submitted as open source code. Participants then retrained their models on the full data set of 1981 samples and submitted up to five models for validation in a newly generated data set of 184 breast cancer patients. Analysis of the BCC results suggests that the best-performing modeling strategy outperformed previously reported methods in blinded evaluations; model performance was consistent across several independent evaluations; and aggregating community-developed models achieved performance on par with the best-performing individual models. Copyright 2013 by the American Association for the Advancement of Science; all rights reserve
Systematic Analysis of Challenge-Driven Improvements in Molecular Prognostic Models for Breast Cancer
Although molecular prognostics in breast cancer are among the most successful examples of translating genomic analysis to clinical applications, optimal approaches to breast cancer clinical risk prediction remain controversial. The Sage Bionetworks–DREAM Breast Cancer Prognosis Challenge (BCC) is a crowdsourced research study for breast cancer prognostic modeling using genome-scale data. The BCC provided a community of data analysts with a common platform for data access and blinded evaluation of model accuracy in predicting breast cancer survival on the basis of gene expression data, copy number data, and clinical covariates. This approach offered the opportunity to assess whether a crowdsourced community Challenge would generate models of breast cancer prognosis commensurate with or exceeding current best-in-class approaches. The BCC comprised multiple rounds of blinded evaluations on held-out portions of data on 1981 patients, resulting in more than 1400 models submitted as open source code. Participants then retrained their models on the full data set of 1981 samples and submitted up to five models for validation in a newly generated data set of 184 breast cancer patients. Analysis of the BCC results suggests that the best-performing modeling strategy outperformed previously reported methods in blinded evaluations; model performance was consistent across several independent evaluations; and aggregating community-developed models achieved performance on par with the best-performing individual models
Energy Landscapes, Folding Mechanisms, and Kinetics of RNA Tetraloop Hairpins
This is the accepted manuscript for a paper published in Journal of the American Chemical Society, 2014, 136 (52), pp 18052?18061 DOI: 10.1021/ja5100756RNA hairpins play a pivotal role in a diverse range of cellular functions, and are integral components of ribozymes, mRNA, and riboswitches. However, the mechanistic and kinetic details of RNA hairpin folding, which are key determinants of most of its biological functions, are poorly understood. In this work, we use the discrete path sampling (DPS) approach to explore the energy landscapes of two RNA tetraloop hairpins, and provide insights into their folding mechanisms and kinetics in atomistic detail. Our results show that the potential energy landscapes have a distinct funnel-like bias toward the folded hairpin state, consistent with efficient structure-seeking properties. Mechanistic and kinetic information is analyzed in terms of kinetic transition networks. We find microsecond folding times, consistent with temperature jump experiments, for hairpin folding initiated from relatively compact unfolded states. This process is essentially driven by an initial collapse, followed by rapid zippering of the helix stem in the final phase. Much lower folding rates are predicted when the folding is initiated from extended chains, which undergo longer excursions on the energy landscape before nucleation events can occur. Our work therefore explains recent experiments and coarse-grained simulations, where the folding kinetics exhibit precisely this dependency on the initial conditions.We are grateful to Dr. David de Sancho, Dr. Yassmine Chebaro,\ud
Dr. Guillem Portella, Dr. Chris Whittleston, and Dr. Joanne M.\ud
Carr for helpful discussions. We also thank Mr. Boris Fackovec\ud
for his comments on an initial version of the manuscript. The\ud
work was financially supported by the ERC. D.C. gratefully\ud
acknowledges the Cambridge Commonwealth, European and\ud
International Trust for financial support