Search CORE

INRIA a CCSD electronic archive server

HAL Descartes

Recommended from our members

iBBiG: iterative binary bi-clustering of gene sets

Author: Bentink Stefan
Culhane Aedín C.
Gusenleitner Daniel
Howe Eleanor A.
Quackenbush John
Publication venue: 'Oxford University Press (OUP)'
Publication date: 24/04/2013
Field of study

Motivation: Meta-analysis of genomics data seeks to identify genes associated with a biological phenotype across multiple datasets; however, merging data from different platforms by their features (genes) is challenging. Meta-analysis using functionally or biologically characterized gene sets simplifies data integration is biologically intuitive and is seen as having great potential, but is an emerging field with few established statistical methods. Results: We transform gene expression profiles into binary gene set profiles by discretizing results of gene set enrichment analyses and apply a new iterative bi-clustering algorithm (iBBiG) to identify groups of gene sets that are coordinately associated with groups of phenotypes across multiple studies. iBBiG is optimized for meta-analysis of large numbers of diverse genomics data that may have unmatched samples. It does not require prior knowledge of the number or size of clusters. When applied to simulated data, it outperforms commonly used clustering methods, discovers overlapping clusters of diverse sizes and is robust in the presence of noise. We apply it to meta-analysis of breast cancer studies, where iBBiG extracted novel gene set—phenotype association that predicted tumor metastases within tumor subtypes

A multivariate approach to the integration of multi-omics datasets

Author: Aedín C Culhane
Amin Moghaddas Gholami
Bernhard Kuster
Chen Meng
Meng Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Background: To leverage the potential of multi-omics studies, exploratory data analysis methods that provide systematic integration and comparison of multiple layers of omics information are required. We describe multiple co-inertia analysis (MCIA), an exploratory data analysis method that identifies co-relationships between multiple high dimensional datasets. Based on a covariance optimization criterion, MCIA simultaneously projects several datasets into the same dimensional space, transforming diverse sets of features onto the same scale, to extract the most variant from each dataset and facilitate biological interpretation and pathway analysis. Results: We demonstrate integration of multiple layers of information using MCIA, applied to two typical “omics” research scenarios. The integration of transcriptome and proteome profiles of cells in the NCI-60 cancer cell line panel revealed distinct, complementary features, which together increased the coverage and power of pathway analysis. Our analysis highlighted the importance of the leukemia extravasation signaling pathway in leukemia that was not highly ranked in the analysis of any individual dataset. Secondly, we compared transcriptome profiles of high grade serous ovarian tumors that were obtained, on two different microarray platforms and next generation RNA-sequencing, to identify the most informative platform and extract robust biomarkers of molecular subtypes. We discovered that the variance of RNA-sequencing data processed using RPKM had greater variance than that with MapSplice and RSEM. We provided novel markers highly associated to tumor molecular subtype combined from four data platforms. MCIA is implemented and available in the R/Bioconductor “omicade4” package. Conclusion: We believe MCIA is an attractive method for data integration and visualization of several datasets of multi-omics features observed on the same set of individuals. The method is not dependent on feature annotation, and thus it can extract important features even when there are not present across all datasets. MCIA provides simple graphical representations for the identification of relationships between large datasets

Springer - Publisher Connector

Springer

Recommended from our members

Stem Cell-Like Gene Expression in Ovarian Cancer Predicts Type II Subtype and Prognosis

Author: Bentink Stefan
Culhane Aedín C.
Haibe-Kains Benjamin
Harrington David Paul
Hofmann Oliver Marc
Quackenbush John
Schwede Matthew
Spentzos Dimitrios
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 11/03/2013
Field of study

Although ovarian cancer is often initially chemotherapy-sensitive, the vast majority of tumors eventually relapse and patients die of increasingly aggressive disease. Cancer stem cells are believed to have properties that allow them to survive therapy and may drive recurrent tumor growth. Cancer stem cells or cancer-initiating cells are a rare cell population and difficult to isolate experimentally. Genes that are expressed by stem cells may characterize a subset of less differentiated tumors and aid in prognostic classification of ovarian cancer. The purpose of this study was the genomic identification and characterization of a subtype of ovarian cancer that has stem cell-like gene expression. Using human and mouse gene signatures of embryonic, adult, or cancer stem cells, we performed an unsupervised bipartition class discovery on expression profiles from 145 serous ovarian tumors to identify a stem-like and more differentiated subgroup. Subtypes were reproducible and were further characterized in four independent, heterogeneous ovarian cancer datasets. We identified a stem-like subtype characterized by a 51-gene signature, which is significantly enriched in tumors with properties of Type II ovarian cancer; high grade, serous tumors, and poor survival. Conversely, the differentiated tumors share properties with Type I, including lower grade and mixed histological subtypes. The stem cell-like signature was prognostic within high-stage serous ovarian cancer, classifying a small subset of high-stage tumors with better prognosis, in the differentiated subtype. In multivariate models that adjusted for common clinical factors (including grade, stage, age), the subtype classification was still a significant predictor of relapse. The prognostic stem-like gene signature yields new insights into prognostic differences in ovarian cancer, provides a genomic context for defining Type I/II subtypes, and potential gene targets which following further validation may be valuable in the clinical management or treatment of ovarian cancer

University of Melbourne Institutional Repository

FigShare

GeneSigDB—a curated database of gene expression signatures

Author: Correll Mick
Culhane Aedín C.
Franklin Katherine R.
French Simon J.
Lu Tim H.
Papenhausen Gerald
Picard Kermshlise C.
Picard Shaita C.
Quackenbush John
Schwarzl Thomas
Sultana Razvan
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated with a specific biological phenotype. Whilst thousands of experimentally derived gene signatures are published, their potential value to the community is limited by their computational inaccessibility. Gene signatures are embedded in published article figures, tables or in supplementary materials, and are frequently presented using non-standard gene or probeset nomenclature. We present GeneSigDB (http://compbio.dfci.harvard.edu/genesigdb) a manually curated database of gene expression signatures. GeneSigDB release 1.0 focuses on cancer and stem cells gene signatures and was constructed from more than 850 publications from which we manually transcribed 575 gene signatures. Most gene signatures (n = 560) were successfully mapped to the genome to extract standardized lists of EnsEMBL gene identifiers. GeneSigDB provides the original gene signature, the standardized gene list and a fully traceable gene mapping history for each gene from the original transcribed data table through to the standardized list of genes. The GeneSigDB web portal is easy to search, allows users to compare their own gene list to those in the database, and download gene signatures in most common gene identifier formats

CiteSeerX

Carolina Digital Repository

CENP-F expression is associated with poor prognosis and chromosomal instability in patients with primary breast cancer

Author: Brennan Donal J.
Culhane Aedín C.
Duffy Michael J.
Fagan Ailís
Fox Edward J.P.
Gallagher William M.
Hegarty Shauna
Higgins Desmond G.
Jirström Karin
Landberg Göran
McCann Amanda H.
Millikan Robert C.
Moyna Siobhan
O'Brien Sallyann L.
Publication venue
Publication date: 01/01/2007
Field of study

DNA microarrays have the potential to classify tumors according to their transcriptome. Tissue microarrays (TMAs) facilitate the validation of biomarkers by offering a high-throughput approach to sample analysis. We reanalyzed a high profile breast cancer DNA microarray dataset containing 96 tumor samples using a powerful statistical approach, between group analyses. Among the genes we identified was centromere protein-F (CENP-F), a gene associated with poor prognosis. In a published follow-up breast cancer DNA microarray study, comprising 295 tumour samples, we found that CENP-F upregulation was significantly associated with worse overall survival (p < 0.001) and reduced metastasis-free survival (p < 0.001). To validate and expand upon these findings, we used 2 independent breast cancer patient cohorts represented on TMAs. CENP-F protein expression was evaluated by immunohistochemistry in 91 primary breast cancer samples from cohort I and 289 samples from cohort II. CENP-F correlated with markers of aggressive tumor behavior including ER negativity and high tumor grade. In cohort I, CENP-F was significantly associated with markers of CIN including cyclin E, increased telomerase activity, c-Myc amplification and aneuploidy. In cohort II, CENP-F correlated with VEGFR2, phosphorylated Ets-2 and Ki67, and in multivariate analysis, was an independent predictor of worse breast cancer-specific survival (p = 0.036) and overall survival (p = 0.040). In conclusion, we identified CENP-F as a biomarker associated with poor outcome in breast cancer and showed several novel associations of biological significance

The Stem Cell Discovery Engine: an integrated repository and analysis system for cancer stem cell comparisons

Author: Aedín C. Culhane
Andrei Krivtsov
Bao
Bard
Barrett
Ben-Porath
Blankenberg
Brad Chapman
Brazma
Culhane
Dick
Dorothy Reilly
Eamonn Maguire
Gabriel M. Altschuler
Gaudet
Goecks
Kappadakunnel
Kimberly Begley
Liberzon
Mick Correll
Oliver Hofmann
Onaitis
Pece
Philippe Rocca-Sera
Pico
Porter
Ramakrishna Sompallae
Ramesh A. Shivdasani
Ray McGovern
Reya
Rocca-Serra
Sandie
Sansone
Sansone
Scott A. Armstrong
Shannan J. Ho Sui
Sjolund
Susanna-Assunta Sansone
Terah A. A. Hansen
Varnat
Whetzel
Winston Hide
Yang
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Mounting evidence suggests that malignant tumors are initiated and maintained by a subpopulation of cancerous cells with biological properties similar to those of normal stem cells. However, descriptions of stem-like gene and pathway signatures in cancers are inconsistent across experimental systems. Driven by a need to improve our understanding of molecular processes that are common and unique across cancer stem cells (CSCs), we have developed the Stem Cell Discovery Engine (SCDE)—an online database of curated CSC experiments coupled to the Galaxy analytical framework. The SCDE allows users to consistently describe, share and compare CSC data at the gene and pathway level. Our initial focus has been on carefully curating tissue and cancer stem cell-related experiments from blood, intestine and brain to create a high quality resource containing 53 public studies and 1098 assays. The experimental information is captured and stored in the multi-omics Investigation/Study/Assay (ISA-Tab) format and can be queried in the data repository. A linked Galaxy framework provides a comprehensive, flexible environment populated with novel tools for gene list comparisons against molecular signatures in GeneSigDB and MSigDB, curated experiments in the SCDE and pathways in WikiPathways. The SCDE is available at http://discovery.hsci.harvard.edu

University of Melbourne Institutional Repository

The CIN4 chromosomal instability qPCR classifier defines tumor aneuploidy and stratifies outcome in grade 2 breast cancer.

Author: A Szabo
Aedín C. Culhane
AJX Lee
Andrew Rowan
András Kiss
Anna-Mária Tőkés
Aron C. Eklund
As Chassevent
Attila Marcell Szász
AV Ivshina
Balázs Győrffy
Borbála Székely
C Lengauer
C Sotiriou
C Swanton
Charles Swanton
CM Perou
E Eisenberg
H Fiegler
H Nakamura
J Toussaint
Janina Kulka
JK Habermann
JS Reis-Filho
K Heselmeyer-Haddad
LW Dalton
M Colleoni
M Gerlinger
Miklós Szendrői
MJ Van de Vijver
NJ Birkbak
Q Li
Qiyuan Li
R Ellsworth
R Roylance
R Tibshirani
RA Irizarry
S Loi
S Paik
SE McClelland
SF Bakhoum
SL Carter
XJ Ma
Zoltán Szállási
Zsófia Sztupinszki
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Purpose: Quantifying chromosomal instability (CIN) has both prognostic and predictive clinical utility in breast cancer. In order to establish a robust and clinically applicable gene expression-based measure of CIN, we assessed the ability of four qPCR quantified genes selected from the 70-gene Chromosomal Instability (CIN70) expression signature to stratify outcome in patients with grade 2 breast cancer. Methods: AURKA, FOXM1, TOP2A and TPX2 (CIN4), were selected from the CIN70 signature due to their high level of correlation with histological grade and mean CIN70 signature expression in silico. We assessed the ability of CIN4 to stratify outcome in an independent cohort of patients diagnosed between 1999 and 2002. 185 formalin-fixed, paraffin-embedded (FFPE) samples were included in the qPCR measurement of CIN4 expression. In parallel, ploidy status of tumors was assessed by flow cytometry. We investigated whether the categorical CIN4 score derived from the CIN4 signature was correlated with recurrence-free survival (RFS) and ploidy status in this cohort. Results: We observed a significant association of tumor proliferation, defined by Ki67 and mitotic index (MI), with both CIN4 expression and aneuploidy. The CIN4 score stratified grade 2 carcinomas into good and poor prognostic cohorts (mean RFS: 83.864.9 and 69.4 +- 8.2 months, respectively, p = 0.016) and its predictive power was confirmed by multivariate analysis outperforming MI and Ki67 expression. Conclusions: The first clinically applicable qPCR derived measure of tumor aneuploidy from FFPE tissue, stratifies grade 2 tumors into good and poor prognosis groups

Public Library of Science (PLOS)

UCL Discovery

Repository of the Academy's Library

Semmelweis Repository

Online Research Database In Technology

FigShare

TGFBR1 Intralocus Epistatic Interaction as a Risk Factor for Colorectal Cancer

Author: A Castillejo
A Castillejo
A de la Chapelle
A Forsti
Adela Castillejo
Aedín C. Culhane
Ana Martinez-Canto
AR Hsieh
B Pasche
B Pasche
C Abadie
C Morcillo-Suarez
Carla Guarinos
Cecilia Egoavil
Cristina Alenda
D Serre
DA Shelly
DB Goldstein
E Castellsague
Enrique Ochoa
Esperanza Irles
Eva Hernandez-Illan
G Suriano
H Yan
HT Lynch
IP Tomlinson
J Tomsic
Javier Lacueva
JC Knight
Jose Luis Soto
K Guda
L Valle
LG Carvajal-Carmona
Maria-Isabel Castillejo
N Segui
O Fletcher
Rafael Calpena
Rafael Lazaro
RY Liao
Silvia Fajardo
Trinidad Mata-Balaguer
Victor Manuel Barbera
Y Bian
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

In colorectal cancer (CRC), an inherited susceptibility risk affects about 35% of patients, whereas high-penetrance germline mutations account for <6% of cases. A considerable proportion of sporadic tumors could be explained by the coinheritance of multiple low-penetrance variants, some of which are common. We assessed the susceptibility to CRC conferred by genetic variants at the TGFBR1 locus. We analyzed 14 polymorphisms and the allele-specific expression (ASE) of TGFBR1 in 1025 individuals from the Spanish population. A case-control study was undertaken with 504 controls and 521 patients with sporadic CRC. Fourteen polymorphisms located at the TGFBR1 locus were genotyped with the iPLEX Gold (MassARRAY-Sequenom) technology. Descriptive analyses of the polymorphisms and haplotypes and association studies were performed with the SNPator workpackage. No relevant associations were detected between individual polymorphisms or haplotypes and the risk of CRC. The TGFBR1*9A/6A polymorphism was used for the ASE analysis. Heterozygous individuals were analyzed for ASE by fragment analysis using cDNA from normal tissue. The relative level of allelic expression was extrapolated from a standard curve. The cutoff value was calculated with Youden's index. ASE was found in 25.4% of patients and 16.4% of controls. Considering both bimodal and continuous types of distribution, no significant differences between the ASE values of patients and controls were identified. Interestingly, a combined analysis of the polymorphisms and ASE for the association with CRC occurrence revealed that ASE-positive individuals carrying one of the most common haplotypes (H2: 20.7%) showed remarkable susceptibility to CRC (RR: 5.25; 95% CI: 2.547–5.250; p<0.001) with a synergy factor of 3.7. In our study, 54.1% of sporadic CRC cases were attributable to the coinheritance of the H2 haplotype and TGFBR1 ASE. These results support the hypothesis that the allelic architecture of cancer genes, rather than individual polymorphisms, more accurately defines the CRC risk

Repositorio Institucional de la Universidad de Alicante

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

FigShare

Characteristics and outcomes of over 300,000 patients with COVID-19 and history of cancer in the United States and Spain

Author: Ahmed Waheed-Ul-Rahman
Alghoul Heba
Alser Osaid
Alshammari Thamir M.
Aragon Maria
Areia Carlos
Blacketer Clair
Carter William
Casajust Paula
Culhane Aedín C.
Dawoud Dalia
DeFalco Frank
Duarte-Salles Talita
DuVall Scott L.
Falconer Thomas
Fernandez-Bertolin Sergio
Golozar Asieh
Gong Mengchun
Hester Laura
Hripcsak George
Jeon Hokyun
Jonnagaddala Jitendra
Kostka Kristin
Lai Lana Yh
Lynch Kristine E.
Matheny Michael E.
Morales Daniel R.
Natarajan Karthik
Nyberg Fredrik
Ostropolets Anna
Pistillo Andrea
Posada Jose D.
Prats-Uribe Albert
Prieto-Alhambra Daniel
Puente Diana
Recalde Martina
Reich Christian G.
Rivera Donna R.
Roel Elena
Ryan Patrick B.
Schilling Lisa M.
Sena Anthony G.
Shah Karishma
Shah Nigam H.
Shen Yang
Soerjomataram Isabelle
Spotnitz Matthew
Subbian Vignesh
Suchard Marc A.
Tan Eng Hooi
Trama Annalisa
Zhang Lin
Zhang Ying
Publication venue
Publication date: 16/07/2021
Field of study

Background: We described the demographics, cancer subtypes, comorbidities, and outcomes of patients with a history of cancer and coronavirus disease 2019 (COVID-19). Second, we compared patients hospitalized with COVID-19 to patients diagnosed with COVID-19 and patients hospitalized with influenza. Methods: We conducted a cohort study using eight routinely collected health care databases from Spain and the United States, standardized to the Observational Medical Outcome Partnership common data model. Three cohorts of patients with a history of cancer were included: (i) diagnosed with COVID-19, (ii) hospitalized with COVID-19, and (iii) hospitalized with influenza in 2017 to 2018. Patients were followed from index date to 30 days or death. We reported demographics, cancer subtypes, comorbidities, and 30-day outcomes. Results: We included 366,050 and 119,597 patients diagnosed and hospitalized with COVID-19, respectively. Prostate and breast cancers were the most frequent cancers (range: 5%–18% and 1%–14% in the diagnosed cohort, respectively). Hematologic malignancies were also frequent, with non-Hodgkin’s lymphoma being among the five most common cancer subtypes in the diagnosed cohort. Overall, patients were aged above 65 years and had multiple comorbidities. Occurrence of death ranged from 2% to 14% and from 6% to 26% in the diagnosed and hospitalized COVID-19 cohorts, respectively. Patients hospitalized with influenza (n ¼ 67,743) had a similar distribution of cancer subtypes, sex, age, and comorbidities but lower occurrence of adverse events. Conclusions: Patients with a history of cancer and COVID-19 had multiple comorbidities and a high occurrence of COVID-19-related events. Hematologic malignancies were frequent. Impact: This study provides epidemiologic characteristics that can inform clinical care and etiologic studies.</p

ZENODO