Search CORE

29 research outputs found

A Crowdsourcing Approach to Developing and Assessing Prediction Algorithms for AML Prognosis

Author: &#379
Abrams Zachary
Ambrosini Giovanna
Anastassiou Dimitris
Baladandayuthapani Veerabhadran
Batten Kimberly
Bisberg Alex J.
Boutros Paul C.
Bucher Philipp
Buturovic Ljubomir
Campion Loic
Chen Gregory M.
Chen Greg
Cheong Jae-Ho
Creighton Chad J.
Di Camillo Barbara
Dreos Ren&#233
Engquist Erik
Estrada Alan
Fatemi Seyyed A.
Fitzgerald Andrew
Flynn Jennifer
Friend Stephen H.
Fronczuk Maciej
Guha Subharup
Hess Kenneth
Hosseini Maryam
Hu Chenyue Wendy
Hung Ling-Hong
Hunter Geoffrey A. M.
Hunter Geoffrey
Hwang Tae Hyun
Jieping Ye
Jinpu Li
Kim Daniel
Kim Minsoo
Kornblau Steven
Korra Jyothi
Krstajic Damjan
Kuh Anthony
Kumar Sunil
Lin Xihui
Liu Li
Liu Yashu
Long Byron L.
Mcmurray James
Morgan Daniel
Motiwala Tasneem
Naegle Kristen
Niemiec Rafa&#322
Norel Raquel
Noren David P.
Norman Thea
Oehler Vivian G.
Park Sunho
Pattin Alejandrina
Peabody Andrea
Piraino Scott W.
Qutub Amina A.
Regan Kelly
Ro&#347
Ronan Tom
Rrhissorrakrai Kahn
Rudnicki Witold
Sanavia Tiziana
Santhanam Narayana
Schultz Andre
Shay Jerry
Stepanov Oleg
Stolovitzky Gustavo
Tang Hao
Vilar Jose M. G.
Wang Tao
Weiyi Gu
Wright Woodring
Wrzesie&#324
Xiao Guanghua
Xie Honglei
Xie Yang
Yang Tai-Hsien Ou
Yang Sen
Yang Tao
Yeung Ka Yee
Zang Xiao
Zolfaghar Kiyana
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

Institutional Research Information System University of Turin

Prediction of overall survival for patients with metastatic castration-resistant prostate cancer : development of a prognostic model through a crowdsourced challenge with open clinical trial data

Author: Abdallah Kald
Abdallah Kald
Airola Antti
Airola Antti
Aittokallio Tero
Aittokallio Tero
Anghe Catalina
Ankerst Donna P
Azima Helia
Baertsch Robert
Ballester Pedro J
Bare Chris
Bare J Christopher
Bhandari Vinayak
Bot Brian M
Bot Brian M
Buchardt Ann-Sophie
Buturovic Ljubomir
Cao Da
Chalise Prabhakar
Chang Billy HW
Cho Junwoo
Chu Tzu-Ming
Coley R Yates
Conjeti Sailesh
Correia Sara
Costello James C
Costello James C
Dai Junqiang
Dai Ziwei
Dang Cuong C
Dargatz Philip
Delavarkhan Sam
Deng Detian
Dhanik Ankur
Du Yu
Dunbar Maria Bekker-Nielsen
Elangovan Aparna
Ellis Shellie
Elo Laura L
Espiritu Shadrielle M
Fan Fan
Farshi Ashkan B
Freitas Ana
Fridley Brooke
Friend Stephen
Friend Stephen
Fuchs Christiane
Gofer Eyal
Golinska Agnieszka K
Graw Stefan
Greiner Russ
Guan Yuanfang
Guinney Justin
Guinney Justin
Guo Jing
Gupta Pankaj
Guyer Anna I
Han Jiawei
Hansen Niels R
Hirvonen Outi
Huang Barbara
Huang Chao
Hwang Jinseub
Ibrahim Joseph G
Jayaswa Vivek
Jeon Jouhyun
Ji Zhicheng
Juvvadi Deekshith
Jyrkkiö Sirkku
Kanigel-Winner Kimberly
Katouzian Amin
Kazanov Marat D
Khan Suleiman A
Khan Suleiman A
Khayyer Shahin
Kim Dalho
Koestler Devin
Kokowicz Fernanda
Kondofersky Ivan
Krautenbacher Norbert
Krstajic Damjan
Kumar Luke
Kurz Christoph
Kyan Matthew
Laajala Teemu D
Laajala Teemu D
Laimighofer Michael
Lee Eunjee
Lesinski Wojciech
Li Miaozhu
Li Ye
Lian Qiuyu
Liang Xiaotao
Lim Minseong
Lin Henry
Lin Xihui
Lu Jing
Mahmoudian Mehrad
Manshaei Roozbeh
Meier Richard
Miljkovic Dejan
Mirtti Tuomas
Mirtti Tuomas
Mnich Krzysztof
Navab Nassir
Neto Elias C
Neto Elias Chaibub
Newton Yulia
Norman Thea
Norman Thea
Pahikkala Tapio
Pahikkala Tapio
Pal Subhabrata
Park Byeongju
Patel Jaykumar
Pathak Swetabh
Pattin Alejandrina
Peddinti Gopal
Peddinti Gopalacharyulu
Peng Jian
Petersen Anne H
Philip Robin
Piccolo Stephen R
Polewko-Klim Aneta
Pölsterl Sebastian
Rao Karthik
Ren Xiang
Rocha Miguel
Rudnicki Witold R.
Ryan Charles J
Ryan Charles J
Ryu Hyunnam
Sartor Oliver
Sartor Oliver
Scher Howard I
Scherb Hagen
Sehgal Raghav
Seyednasrollah Fatemeh
Shang Jingbo
Shao Bin
Shen Liji
Shen Liji
Sher Howard
Shiga Motoki
Sokolov Artem
Song Lei
Soule Howard
Soule Howard
Stolovitzky Gustavo
Stolovitzky Gustavo
Stuart Josh
Sun Ren
Sweeney Christopher J
Sweeney Christopher J
Söllner Julia F
Tahmasebi Nazanin
Tan Kar-Tong
Tomaziu Lisbeth
Usset Joseph
Vang Yeeleng S
Vega Roberto
Vieira Vitor
Wang David
Wang Difei
Wang Junmei
Wang Lichao
Wang Sheng
Wang Tao
Wang Tao
Wang Yue
Winner Kimberly Kanigel
Wolfinger Russ
Wong Chris
Wu Zhenke
Xiao Jinfeng
Xie Xiaohui
Xie Yang
Xie Yang
Xin Doris
Yang Hojin
Yu Nancy
Yu Thomas
Yu Thomas
Yu Xiang
Zahedi Sulmaz
Zanin Massimiliano
Zhang Chihao
Zhang Jingwen
Zhang Shihua
Zhang Yanchun
Zhou Fang Liz
Zhou Fang Liz
Zhu Hongtu
Zhu Shanfeng
Zhu Yuxin
Publication venue
Publication date: 01/01/2016
Field of study

Background Improvements to prognostic models in metastatic castration-resistant prostate cancer have the potential to augment clinical trial design and guide treatment strategies. In partnership with Project Data Sphere, a not-for-profit initiative allowing data from cancer clinical trials to be shared broadly with researchers, we designed an open-data, crowdsourced, DREAM (Dialogue for Reverse Engineering Assessments and Methods) challenge to not only identify a better prognostic model for prediction of survival in patients with metastatic castration-resistant prostate cancer but also engage a community of international data scientists to study this disease. Methods Data from the comparator arms of four phase 3 clinical trials in first-line metastatic castration-resistant prostate cancer were obtained from Project Data Sphere, comprising 476 patients treated with docetaxel and prednisone from the ASCENT2 trial, 526 patients treated with docetaxel, prednisone, and placebo in the MAINSAIL trial, 598 patients treated with docetaxel, prednisone or prednisolone, and placebo in the VENICE trial, and 470 patients treated with docetaxel and placebo in the ENTHUSE 33 trial. Datasets consisting of more than 150 clinical variables were curated centrally, including demographics, laboratory values, medical history, lesion sites, and previous treatments. Data from ASCENT2, MAINSAIL, and VENICE were released publicly to be used as training data to predict the outcome of interest-namely, overall survival. Clinical data were also released for ENTHUSE 33, but data for outcome variables (overall survival and event status) were hidden from the challenge participants so that ENTHUSE 33 could be used for independent validation. Methods were evaluated using the integrated time-dependent area under the curve (iAUC). The reference model, based on eight clinical variables and a penalised Cox proportional-hazards model, was used to compare method performance. Further validation was done using data from a fifth trial-ENTHUSE M1-in which 266 patients with metastatic castration-resistant prostate cancer were treated with placebo alone. Findings 50 independent methods were developed to predict overall survival and were evaluated through the DREAM challenge. The top performer was based on an ensemble of penalised Cox regression models (ePCR), which uniquely identified predictive interaction effects with immune biomarkers and markers of hepatic and renal function. Overall, ePCR outperformed all other methods (iAUC 0.791; Bayes factor >5) and surpassed the reference model (iAUC 0.743; Bayes factor >20). Both the ePCR model and reference models stratified patients in the ENTHUSE 33 trial into high-risk and low-risk groups with significantly different overall survival (ePCR: hazard ratio 3.32, 95% CI 2.39-4.62, p Interpretation Novel prognostic factors were delineated, and the assessment of 50 methods developed by independent international teams establishes a benchmark for development of methods in the future. The results of this effort show that data-sharing, when combined with a crowdsourced challenge, is a robust and powerful framework to develop new prognostic models in advanced prostate cancer.Peer reviewe

Universidade do Minho: RepositoriUM

Crossref

PubMed Central

VTT Research System

Publications at Bielefeld University

Helsingin yliopiston digitaalinen arkisto

Cross-validation pitfalls when selecting and assessing regression and classification models

Author: Damjan Krstajic
David E Leahy
Ljubomir J Buturovic
Simon Thomas
Publication venue: Springer Nature
Publication date: 29/03/2014
Field of study

BACKGROUND: We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. A key operational component of the proposed methods is cloud computing which enables routine use of previously infeasible approaches. METHODS: We describe in detail an algorithm for repeated grid-search V-fold cross-validation for parameter tuning in classification and regression, and we define a repeated nested cross-validation algorithm for model assessment. As regards variable selection and parameter tuning we define two algorithms (repeated grid-search cross-validation and double cross-validation), and provide arguments for using the repeated grid-search in the general case. RESULTS: We show results of our algorithms on seven QSAR datasets. The variation of the prediction performance, which is the result of choosing different splits of the dataset in V-fold cross-validation, needs to be taken into account when selecting and assessing classification and regression models. CONCLUSIONS: We demonstrate the importance of repeating cross-validation when selecting an optimal model, as well as the importance of repeating nested cross-validation when assessing a prediction error. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1758-2946-6-10) contains supplementary material, which is available to authorized users

Springer - Publisher Connector

PubMed Central

METHODOLOGY Open Access

Author: Damjan Krstajic
David E Leahy
Ljubomir J Buturovic
Simon Thomas
Publication venue
Publication date
Field of study

Cross-validation pitfalls when selecting and assessing regression and classification model

CiteSeerX

The Optimization and Biological Significance of a 29-Host-Immune-mRNA Panel for the Diagnosis of Acute Infections and Sepsis

Author: Eric M. Wohlford
Florian Uhle
Ljubomir Buturovic
Oliver Liesenfeld
Timothy E. Sweeney
Yudong D. He
Publication venue: 'MDPI AG'
Publication date: 01/07/2021
Field of study

In response to the unmet need for timely accurate diagnosis and prognosis of acute infections and sepsis, host-immune-response-based tests are being developed to help clinicians make more informed decisions including prescribing antimicrobials, ordering additional diagnostics, and assigning level of care. One such test (InSep™, Inflammatix, Inc.) uses a 29-mRNA panel to determine the likelihood of bacterial infection, the separate likelihood of viral infection, and the risk of physiologic decompensation (severity of illness). The test, being implemented in a rapid point-of-care platform with a turnaround time of 30 min, enables accurate and rapid diagnostic use at the point of impact. In this report, we provide details on how the 29-biomarker signature was chosen and optimized, together with its molecular, immunological, and medical significance to better understand the pathophysiological relevance of altered gene expression in disease. We synthesize key results obtained from gene-level functional annotations, geneset-level enrichment analysis, pathway-level analysis, and gene-network-level upstream regulator analysis. Emerging findings are summarized as hallmarks on immune cell interaction, inflammatory mediators, cellular metabolism and homeostasis, immune receptors, intracellular signaling and antiviral response; and converging themes on neutrophil degranulation and activation involved in immune response, interferon, and other signaling pathways

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

High Precision Prediction of Functional Sites in Protein Structures

Author: Dragutin Petkovic (535549)
Grace W. Tang (535548)
Ljubomir Buturovic (535547)
Mike Wong (252865)
Russ B. Altman (6158)
Publication venue
Publication date: 14/03/2014
Field of study

<div><p>We address the problem of assigning biological function to solved protein structures. Computational tools play a critical role in identifying potential active sites and informing screening decisions for further lab analysis. A critical parameter in the practical application of computational methods is the precision, or positive predictive value. Precision measures the level of confidence the user should have in a particular computed functional assignment. Low precision annotations lead to futile laboratory investigations and waste scarce research resources. In this paper we describe an advanced version of the protein function annotation system FEATURE, which achieved 99% precision and average recall of 95% across 20 representative functional sites. The system uses a Support Vector Machine classifier operating on the microenvironment of physicochemical features around an amino acid. We also compared performance of our method with state-of-the-art sequence-level annotator Pfam in terms of precision, recall and localization. To our knowledge, no other functional site annotator has been rigorously evaluated against these key criteria. The software and predictive models are incorporated into the WebFEATURE service at <a href="http://feature.stanford.edu/wf4.0-beta" target="_blank">http://feature.stanford.edu/wf4.0-beta</a>.</p></div

CiteSeerX

Directory of Open Access Journals

PubMed Central

FigShare

Functional families used to evaluate performance of FEATURE.

Author: Dragutin Petkovic (535549)
Grace W. Tang (535548)
Ljubomir Buturovic (535547)
Mike Wong (252865)
Russ B. Altman (6158)
Publication venue
Publication date
Field of study

<p>Column PROSITE lists functional families used to evaluate performance of FEATURE. Column Index is index of the conserved position within the corresponding PROSITE regular expression. Column Amino-acid is code of the amino-acid at that position. Column Atom is the residue atom at which the FEATURE microenvironment is centered.</p

FigShare