Search CORE

12 research outputs found

On optimal Bayesian classification and risk estimation under multiple classes

Author: A Zollanvari
B Efron
B Efron
B Efron
B Hanczar
B Hanczar
BE Boser
C Cortes
C-C Chang
CM Bishop
ER Dougherty
H Xu
H Xu
JM Knight
L Devroye
LA Dalton
LA Dalton
LA Dalton
LA Dalton
LA Dalton
LA Dalton
LA Dalton
Lori A. Dalton
MJ van de Vijver
Mohammadmahdi R. Yousefi
MR Yousefi
MR Yousefi
MR Yousefi
MS Esfahani
NL Johnson
S Kotz
UM Braga-Neto
UM Braga-Neto
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Optimal cancer prognosis under network uncertainty

Author: A Datta
A Garg
A Naldi
BJ Yoon
E Bilal
E Lee
F Li
I Ivanov
I Shmulevich
J Su
J Su
LA Dalton
LA Dalton
Lori A Dalton
M Kanehisa
M Shahrokh Esfahani
M Shahrokh Esfahani
Mohammadmahdi R Yousefi
MR Yousefi
MR Yousefi
X Qian
Y Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Modeling the next generation sequencing sample processing pipeline for the purposes of classification

Author: A Mortazavi
B Langmead
BE Boser
C Alkan
C Cortes
Charles D Johnson
DC Hoyle
DR Bentley
DW Craig
Edward R Dougherty
ER Dougherty
ER Mardis
F Hach
H Li
H Li
I Shmulevich
Ivan Ivanov
J Hua
J Li
JC Marioni
JH Bullard
L Bianchetti
L Jiang
LA Dalton
M Sultan
MD Robinson
MD Robinson
MD Robinson
MD Robinson
Mohammadmahdi R Yousefi
MR Yousefi
Noushin Ghaffari
PL Auer
R Li
RO Duda
S Anders
S Attoor
SM Rumble
SM Wang
W Sun
Y Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

On optimal Bayesian classification and risk estimation under multiple classes

Author: Lori A. Dalton
Mohammadmahdi R. Yousefi
Publication venue: Springer Nature
Publication date: 01/01/2015
Field of study

Springer - Publisher Connector

Data Requirements for Model-Based Cancer Prognosis Prediction

Author: Lori A. Dalton
Mohammadmahdi R. Yousefi
Publication venue: 'SAGE Publications'
Publication date: 01/01/2015
Field of study

Cancer prognosis prediction is typically carried out without integrating scientific knowledge available on genomic pathways, the effect of drugs on cell dynamics, or modeling mutations in the population. Recent work addresses some of these problems by formulating an uncertainty class of Boolean regulatory models for abnormal gene regulation, assigning prognosis scores to each network based on intervention outcomes, and partitioning networks in the uncertainty class into prognosis classes based on these scores. For a new patient, the probability distribution of the prognosis class was evaluated using optimal Bayesian classification, given patient data. It was assumed that (1) disease is the result of several mutations of a known healthy network and that these mutations and their probability distribution in the population are known and (2) only a single snapshot of the patient's gene activity profile is observed. It was shown that, even in ideal settings where cancer in the population and the effect of a drug are fully modeled, a single static measurement is typically not sufficient. Here, we study what measurements are sufficient to predict prognosis. In particular, we relax assumption (1) by addressing how population data may be used to estimate network probabilities, and extend assumption (2) to include static and time-series measurements of both population and patient data. Furthermore, we extend the prediction of prognosis classes to optimal Bayesian regression of prognosis metrics. Even when time-series data is preferable to infer a stochastic dynamical network, we show that static data can be superior for prognosis prediction when constrained to small samples. Furthermore, although population data is helpful, performance is not sensitive to inaccuracies in the estimated network probabilities

Directory of Open Access Journals

PubMed Central

Modeling the next generation sequencing sample processing pipeline for the purposes of classification

Author: Charles D Johnson
Edward R Dougherty
Ghaffari Noushin
Ivan Ivanov
Mohammadmahdi R Yousefi
Noushin Ghaffari
Publication venue: Wiley
Publication date: 01/01/2013
Field of study

BACKGROUND: A key goal of systems biology and translational genomics is to utilize high-throughput measurements of cellular states to develop expression-based classifiers for discriminating among different phenotypes. Recent developments of Next Generation Sequencing (NGS) technologies can facilitate classifier design by providing expression measurements for tens of thousands of genes simultaneously via the abundance of their mRNA transcripts. Because NGS technologies result in a nonlinear transformation of the actual expression distributions, their application can result in data that are less discriminative than would be the actual expression levels themselves, were they directly observable. RESULTS: Using state-of-the-art distributional modeling for the NGS processing pipeline, this paper studies how that pipeline, via the resulting nonlinear transformation, affects classification and feature selection. The effects of different factors are considered and NGS-based classification is compared to SAGE-based classification and classification directly on the raw expression data, which is represented by a very high-dimensional model previously developed for gene expression. As expected, the nonlinear transformation resulting from NGS processing diminishes classification accuracy; however, owing to a larger number of reads, NGS-based classification outperforms SAGE-based classification. CONCLUSIONS: Having high numbers of reads can mitigate the degradation in classification performance resulting from the effects of NGS technologies. Hence, when performing a RNA-Seq analysis, using the highest possible coverage of the genome is recommended for the purposes of classification

Crossref

Springer

Springer - Publisher Connector

PubMed Central

OAKTrust Digital Repository (Texas A&M Univ)