Search CORE

28 research outputs found

progenyClust: an R package for Progeny Clustering

Author: Hu Chenyue W.
Qutub Amina A.
Publication venue
Publication date: 01/01/2016
Field of study

Identifying the optimal number of clusters is a common problem faced by data scientists in various research fields and industry applications. Though many clustering evaluation techniques have been developed to solve this problem, the recently developed algorithm Progeny Clustering is a much faster alternative and one that is relevant to biomedical applications. In this paper, we introduce an R package progenyClust that implements and extends the original Progeny Clustering algorithm for evaluating clustering stability and identifying the optimal cluster number. We illustrate its applicability using two examples: a simulated test dataset for proof-of-concept, and a cell imaging dataset for demonstrating its application potential in biomedical research. The progenyClust package is versatile in that it offers great flexibility for picking methods and tuning parameters. In addition, the default parameter setting as well as the plot and summary methods offered in the package make the application of Progeny Clustering straightforward and coherent

DSpace at Rice University

Proteomics in Acute Myeloid Leukemia

Author: Hu Chenyue W.
Qutub Amina A.
Publication venue: 'IntechOpen'
Publication date: 20/12/2017
Field of study

Acute myeloid leukemia (AML) is an extremely heterogeneous and deadly hematological cancer. Cytogenetic abnormalities and genetic mutations, though well recognized and highly prognostic, do not fully capture the degree of heterogeneities manifested in AML clinically. Additionally, current treatment of AML still largely depends on chemotherapy and allogeneic stem cell transplantation, with few options for personalized and molecularly targeted therapies. Proteomics holds promise for unraveling biological heterogeneities in AML beyond the scope of cytogenetics and genomics. In recent years, proteomics has emerged as an important tool for discovering new diagnostic biomarkers, enabling more prognostic patient classifications, and identifying novel therapeutic targets. In this chapter, we review recent advances in proteomic studies of AML, including an overview of AML pathology, popular proteomic techniques, various applications of proteomics in AML from biomarker discovery to target identification, challenges and future directions in this field

IntechOpen

Crossref

Shrinkage Clustering: A Fast and Size-Constrained Algorithm for Biomedical Applications

Author: Hu Chenyue W.
Li Hanyang
Qutub Amina A.
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 17th International Workshop on Algorithms in Bioinformatics (WABI 2017)
Publication date: 01/01/2017
Field of study

Motivation: Many common clustering algorithms require a two-step process that limits their efficiency. The algorithms need to be performed repetitively and need to be implemented together with a model selection criterion, in order to determine both the number of clusters present in the data and the corresponding cluster memberships. As biomedical datasets increase in size and prevalence, there is a growing need for new methods that are more convenient to implement and are more computationally efficient. In addition, it is often essential to obtain clusters of sufficient sample size to make the clustering result meaningful and interpretable for subsequent analysis. Results: We introduce Shrinkage Clustering, a novel clustering algorithm based on matrix factorization that simultaneously finds the optimal number of clusters while partitioning the data. We report its performances across multiple simulated and actual datasets, and demonstrate its strength in accuracy and speed in application to subtyping cancer and brain tissues. In addition, the algorithm offers a straightforward solution to clustering with cluster size constraints. Given its ease of implementation, computing efficiency and extensible structure, we believe Shrinkage Clustering can be applied broadly to solve biomedical clustering tasks especially when dealing with large datasets

Dagstuhl Research Online Publication Server

Clinical relevance of proteomic profiling in de novo pediatric acute myeloid leukemia:a Children’s Oncology Group study

Author: Alonzo Todd A.
Aplenc Richard
de Bont Eveline S.J.M.
Gamis Alan S.
Gerbing Robert B.
Hoff Fieke W.
Horton Terzah M.
Hu Chenyue W.
Jenkins Gaye N.
Kolb E. Anders
Kornblau Steven M.
Ligeralde Andrew
Meshinchi Soheil
Qiu Yihua
Qutub Amina A.
Ries Rhonda E.
van Dijk Anneke D.
Publication venue: 'Ferrata Storti Foundation (Haematologica)'
Publication date: 13/01/2022
Field of study

Pediatric acute myeloid leukemia (AML) remains a fatal disease for at least 30% of patients, stressing the need for improved therapies and better risk stratification. As proteins are the unifying feature of (epi)genetic and environmental alterations, and are often targeted by novel chemotherapeutic agents, we studied the proteomic landscape of pediatric AML. Protein expression and activation levels were measured in 500 bulk leukemic patients’ samples and 30 control CD34(+) cell samples, using reverse phase protein arrays with 296 strictly validated antibodies. The multistep MetaGalaxy analysis methodology was applied and identified nine protein expression signatures (PrSIG), based on strong recurrent protein expression patterns. PrSIG were associated with cytogenetics and mutational state, and with favorable or unfavorable prognosis. Analysis based on treatment (i.e., ADE vs. ADE plus bortezomib) identified three PrSIG that did better with ADE plus bortezomib than with ADE alone. When PrSIG were studied in the context of cytogenetic risk groups, PrSIG were independently prognostic after multivariate analysis, suggesting a potential value for proteomics in combination with current classification systems. Proteins with universally increased (n=7) or decreased (n=17) expression were observed across PrSIG. Certain proteins significantly differentially expressed from normal could be identified, forming a hypothetical platform for personalized medicine

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

PubMed Central

Dissertations of the University of Groningen

Inferring causal molecular networks: empirical assessment through a community-based effort.

Author: A de la Fuente
AA Margolin
Adrian Bivol
Alexander J Bisberg
Alexander V Favorov
Amina A Qutub
Artem Sokolov
Bahman Afsari
BT Hennessy
Byron L Long
C Olsen
Chenyue W Hu
Chris K Wong
CM Chresta
D Freedman
D Husmeier
D Marbach
D Marbach
Dane Taylor
Daniel E Carlin
David P Noren
EG Cerami
EJ Molinelli
Elana J Fertig
Evan O Paull
F Eduati
F Eduati
F Markowetz
Fan Zhu
G Stolovitzky
G Stolovitzky
Gordon B Mills
Gustavo Stolovitzky
H Wang
Haizhou Wang
Heinz Koeppl
I Cantone
J Barretina
J Saez-Rodriguez
JC Costello
JMJ Derry
Joe W Gray
Joshua M Stuart
Julio Saez-Rodriguez
K Sachs
Kiley Graim
Laura M Heiser
Ludmila V Danilova
M Bansal
M Hecker
MH Maathuis
Michael Kellen
Michael Unger
Mingzhou Song
MJ Garnett
N Friedman
Nicole K Nesser
O Guitart-Pla
P Mertins
P Meyer
P Shannon
Paul T Spellman
R Akbani
R De Smet
R Tibes
RJ Prill
RJ Prill
RM Neve
Sach Mukherjee
SM Hill
SR Maetschke
Stephen Friend
Steven M Hill
T Cokelaer
T Ideker
Thea Norman
Thomas Cokelaer
Wai Shing Lee
WW Chen
Y Benjamini
Yang Zhang
Yuanfang Guan
Publication venue: Nat Methods
Publication date: 01/01/2015
Field of study

It remains unclear whether causal, rather than merely correlational, relationships in molecular networks can be inferred in complex biological settings. Here we describe the HPN-DREAM network inference challenge, which focused on learning causal influences in signaling networks. We used phosphoprotein data from cancer cell lines as well as in silico data from a nonlinear dynamical model. Using the phosphoprotein data, we scored more than 2,000 networks submitted by challenge participants. The networks spanned 32 biological contexts and were scored in terms of causal validity with respect to unseen interventional data. A number of approaches were effective, and incorporating known biology was generally advantageous. Additional sub-challenges considered time-course prediction and visualization. Our results suggest that learning causal relationships may be feasible in complex settings such as disease states. Furthermore, our scoring approach provides a practical way to empirically assess inferred molecular networks in a causal sense

TUbiblio

Crossref

PubMed Central

eScholarship - University of California

Warwick Research Archives Portal Repository

Apollo (Cambridge)

DSpace at Rice University

A Crowdsourcing Approach to Developing and Assessing Prediction Algorithms for AML Prognosis

Author: &#379
Abrams Zachary
Ambrosini Giovanna
Anastassiou Dimitris
Baladandayuthapani Veerabhadran
Batten Kimberly
Bisberg Alex J.
Boutros Paul C.
Bucher Philipp
Buturovic Ljubomir
Campion Loic
Chen Gregory M.
Chen Greg
Cheong Jae-Ho
Creighton Chad J.
Di Camillo Barbara
Dreos Ren&#233
Engquist Erik
Estrada Alan
Fatemi Seyyed A.
Fitzgerald Andrew
Flynn Jennifer
Friend Stephen H.
Fronczuk Maciej
Guha Subharup
Hess Kenneth
Hosseini Maryam
Hu Chenyue Wendy
Hung Ling-Hong
Hunter Geoffrey A. M.
Hunter Geoffrey
Hwang Tae Hyun
Jieping Ye
Jinpu Li
Kim Daniel
Kim Minsoo
Kornblau Steven
Korra Jyothi
Krstajic Damjan
Kuh Anthony
Kumar Sunil
Lin Xihui
Liu Li
Liu Yashu
Long Byron L.
Mcmurray James
Morgan Daniel
Motiwala Tasneem
Naegle Kristen
Niemiec Rafa&#322
Norel Raquel
Noren David P.
Norman Thea
Oehler Vivian G.
Park Sunho
Pattin Alejandrina
Peabody Andrea
Piraino Scott W.
Qutub Amina A.
Regan Kelly
Ro&#347
Ronan Tom
Rrhissorrakrai Kahn
Rudnicki Witold
Sanavia Tiziana
Santhanam Narayana
Schultz Andre
Shay Jerry
Stepanov Oleg
Stolovitzky Gustavo
Tang Hao
Vilar Jose M. G.
Wang Tao
Weiyi Gu
Wright Woodring
Wrzesie&#324
Xiao Guanghua
Xie Honglei
Xie Yang
Yang Tai-Hsien Ou
Yang Sen
Yang Tao
Yeung Ka Yee
Zang Xiao
Zolfaghar Kiyana
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

Institutional Research Information System University of Turin

Inferring causal molecular networks: empirical assessment through a community-based effort

Author: Afsari Bahman
Al-Ouran Rami
Anton Bernat
Arodz Tomasz
Bagheri Neda
Berlow Noah
Bisberg Alexander J.
Bivol Adrian
Bohler Anwesha
Bonet Jaume
Bonneau Richard
Budak Gungor
Bunescu Razvan
Caglar Mehmet
Cai Binghuang
Cai Chunhui
Carlin Daniel E.
Carlon Azzurra
Chen Lujia
Ciaccio Mark F.
Cokelaer Thomas
Cooper Gregory
Coort Susan
Creighton Chad J.
Daneshmand Seyed-Mohammad-Hadi
Danilova Ludmila V.
De La Fuente Alberto
Di Camillo Barbara
Dutta-Moscato Joyeeta
Emmett Kevin
Evelo Chris
Fassia Mohammad-Kasim H.
Favorov Alexander V.
Fertig Elana J.
Finkle Justin D.
Finotello Francesca
Friend Stephen
Gao Jean
Gao Xi
Ghosh Samik
Giaretta Alberto
Graim Kiley
Gray Joe W.
Großeholz Ruth
Guan Yuanfang
Guinney Justin
Hafemeister Christoph
Hahn Oliver
Haider Saad
Hase Takeshi
Heiser Laura M.
Hill Steven M.
Hodgson Jay
Hoff Bruce
Hsu Chih Hao
Hu Chenyue W.
Hu Ying
Huang Xun
Jalili Mahdi
Jiang Xia
Kacprowski Tim
Kaderali Lars
Kang Mingon
Kannan Venkateshan
Kellen Michael
Kikuchi Kaito
Kim Dong-Chul
Kitano Hiroaki
Knapp Bettina
Koeppl Heinz
Komatsoulis George
Krämer Andreas
Kursa Miron Bartosz
Kutmon Martina
Lee Wai Shing
Li Yichao
Liang Xiaoyu
Linger Michael
Liu Yu
Liu Zhaoqi
Long Byron L.
Lu Songjian
Lu Xinghua
Manfrini Marco
Matos Marta R. A.
Meerzaman Daoud
Mills Gordon B.
Min Wenwen
Mukherjee Sach
Müller Christian Lorenz
Neapolitan Richard E.
Nesser Nicole K.
Noren David P.
Norman Thea
Oliva Baldo
Opiyo Stephen Obol
Pal Ranadip
Palinkas Aljoscha
Paull Evan O.
Planas-Iglesias Joan
Poglayen Daniel
Qutub Amina A.
Saez-Rodriguez Julio
Sambo Francesco
Sanavia Tiziana
Sharifi-Zarchi Ali
Sichani Omid Askari
Slawek Janusz
Sokolov Artem
Song Mingzhou
Spellman Paul T.
Stolovitzky Gustavo
Streck Adam
Strunz Sonja
Stuart Joshua M.
Taylor Dane
Tegnér Jesper
Thobe Kirste
Toffolo Gianna Maria
Trifoglio Emanuele
Unger Michael
Wan Qian
Wang Haizhou
Welch Lonnie
Wong Chris K.
Wu Jia J.
Xue Albert Y.
Yamanaka Ryota
Yan Chunhua
Zairis Sakellarios
Zengerling Michael
Zenil Hector
Zhang Yang
Zhu Fan
Zi Zhike
Publication venue
Publication date: 01/01/2016
Field of study

Inferring molecular networks is a central challenge in computational biology. However, it has remained unclear whether causal, rather than merely correlational, relationships can be effectively inferred in complex biological settings. Here we describe the HPN-DREAM network inference challenge that focused on learning causal influences in signaling networks. We used phosphoprotein data from cancer cell lines as well as in silico data from a nonlinear dynamical model. Using the phosphoprotein data, we scored more than 2,000 networks submitted by challenge participants. The networks spanned 32 biological contexts and were scored in terms of causal validity with respect to unseen interventional data. A number of approaches were effective and incorporating known biology was generally advantageous. Additional sub-challenges considered time-course prediction and visualization. Our results constitute the most comprehensive assessment of causal network inference in a mammalian setting carried out to date and suggest that learning causal relationships may be feasible in complex settings such as disease states. Furthermore, our scoring approach provides a practical way to empirically assess the causal validity of inferred molecular networks

Carolina Digital Repository

Inferring causal molecular networks: empirical assessment through a community-based effort

Author: Adam Streck
Afsari Bahman
Albert Y. Xue
Alberto de la Fuente
Ali Sharifi Zarchi
Aljoscha Palinkas
Andreas Kr&#228
Anwesha Bohler
Azzurra Carlon
Baldo Oliva
Bernat Anton
Bettina Knapp
Binghuang Cai
Bisberg Alexander J
Bivol Adrian
Bruce Hoff
Carlin Daniel E
Chad J. Creighton
Chenyue W Hu
Chih Hao Hsu
Chris Evelo
Christian Lorenz M&#252
Christoph Hafemeister
Chunhua Yan
Chunhui Cai
Cokelaer Thomas
Daniel Poglayen
Danilova Ludmila V
Daoud Meerzaman
Di Camillo Barbara
Dong Chul Kim
Dutta Moscato
Favorov Alexander V
Fertig Elana J
Finotello Francesca
Friend Stephen
Gao Xi
George Komatsoulis
Giaretta Alberto
Graim Kiley
Gray Joe W
Gregory Cooper
Guan Yuanfang
Gungor Budak
Hector Zenil
Heiser Laura M
Hill Steven M
Hiroaki Kitano
Hpn Dream Consortium: Rami Al Ouran
Janusz Slawek
Jaume Bonet
Javier Garcia Garcia
Jay Hodgson
Jean Gao
Jesper Tegn&#233
Jia J. Wu
Joan Planas Iglesias
Justin D Finkle
Justin Guinney
Kaito Kikuchi
Kellen Michael
Kevin Emmett
Kirste Thobe
Koeppl Heinz
Lars Kaderali
Lee Wai Shing
Liu Yu
Long Byron L
Lonnie Welch
Lujia Chen
Mahdi Jalili
Manfrini Marco
Mark F. Ciaccio
Marta R. A Matos
Martina Kutmon
Mehmet Caglar
Michael Zengerling
Mills Gordon B
Mingon Kang
Miron Bartosz Kursa
Mohammad Kasim H. Fassia
Mukherjee Sach
Neda Bagheri
Nesser Nicole K
Noah Berlow
Noren David P
Norman Thea
Oliver Hahn
Omid Askari Sichani
Paull Evan O
Qian Wan
Qutub Amina A
Ranadip Pal
Razvan Bunescu
Richard E Neapolitan
Richard Bonneau
Ruth Gro&#223
Ryota Yamanaka
Saad Haider
Saez Rodriguez Julio
Sakellarios Zairis
Sambo Francesco
Samik Ghosh
Sanavia Tiziana
Seyed Mohammad Hadi Daneshmand
Shihua Zhang
Sokolov Artem
Song Mingzhou
Songjian Lu
Sonja Strunz
Spellman Paul T
Stephen Obol Opiyo
Stolovitzky Gustavo
Stuart Joshua M
Susan Coort
Takeshi Hase
Taylor Dane
Tim Kacprowski
Toffolo Gianna Maria
Tomasz Arodz
Trifoglio Emanuele
Unger Michael
Venkateshan Kannan
Wang Haizhou
Wenwen Min
Wong Chris K
Xia Jiang
Xiaoyu Liang
Xinghua Lu
Xun Huang
Yichao Li
Ying Hu
Zhang Yang
Zhaoqi Liu
Zhike Zi
Zhu Fan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Institutional Research Information System University of Turin

Archivio istituzionale della ricerca - Università di Padova

Shrinkage Clustering: a fast and size-constrained clustering algorithm for biomedical applications

Author: Amina A. Qutub
Chenyue W. Hu
Hanyang Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Abstract Background Many common clustering algorithms require a two-step process that limits their efficiency. The algorithms need to be performed repetitively and need to be implemented together with a model selection criterion. These two steps are needed in order to determine both the number of clusters present in the data and the corresponding cluster memberships. As biomedical datasets increase in size and prevalence, there is a growing need for new methods that are more convenient to implement and are more computationally efficient. In addition, it is often essential to obtain clusters of sufficient sample size to make the clustering result meaningful and interpretable for subsequent analysis. Results We introduce Shrinkage Clustering, a novel clustering algorithm based on matrix factorization that simultaneously finds the optimal number of clusters while partitioning the data. We report its performances across multiple simulated and actual datasets, and demonstrate its strength in accuracy and speed applied to subtyping cancer and brain tissues. In addition, the algorithm offers a straightforward solution to clustering with cluster size constraints. Conclusions Given its ease of implementation, computing efficiency and extensible structure, Shrinkage Clustering can be applied broadly to solve biomedical clustering tasks especially when dealing with large datasets

Directory of Open Access Journals

DSpace at Rice University

Progeny Clustering: A Method to Identify Biological Phenotypes

Author: Hu Chenyue W.
Kornblau Steven M.
Qutub Amina A.
Slater John H.
Publication venue
Publication date: 12/08/2015
Field of study

Estimating the optimal number of clusters is a major challenge in applying cluster analysis to any type of dataset, especially to biomedical datasets, which are high-dimensional and complex. Here, we introduce an improved method, Progeny Clustering, which is stability-based and exceptionally efficient in computing, to find the ideal number of clusters. The algorithm employs a novel Progeny Sampling method to reconstruct cluster identity, a co-occurrence probability matrix to assess the clustering stability, and a set of reference datasets to overcome inherent biases in the algorithm and data space. Our method was shown successful and robust when applied to two synthetic datasets (datasets of two-dimensions and ten-dimensions containing eight dimensions of pure noise), two standard biological datasets (the Iris dataset and Rat CNS dataset) and two biological datasets (a cell phenotype dataset and an acute myeloid leukemia (AML) reverse phase protein array (RPPA) dataset). Progeny Clustering outperformed some popular clustering evaluation methods in the ten-dimensional synthetic dataset as well as in the cell phenotype dataset, and it was the only method that successfully discovered clinically meaningful patient groupings in the AML RPPA dataset

PubMed Central

University of Delaware Library Institutional Repository

DSpace at Rice University