Search CORE

120 research outputs found

DISCO-SCA and properly applied GSVD as swinging methods to find common and distinctive processes

Author: de Lathauwer L.
de Moor B.
Kiers H.A.L.
Schouteden M.
Smilde A.K.
Thorrez L.
van der Werf M.J.
van Deun K.
van Mechelen I.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

International Migration, Integration and Social Cohesion online publications

Evaluation of O2PLS in Omics data integration

Author: D Rubin
EF Lock
G Box
Geurt Jongbloed
H Liu
H Wold
Hae-Won Uh
I González
J Trygg
J Trygg
Jeanine Houwing-Duistermaat
K Lê Cao
M Bylesjö
M Inouye
M Schouteden
Markus Perola
ME Tipping
Perttu Salo
R Core Team
R Tibshirani
R Wehrens
Said el Bouhaddani
T Löfstedt
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Rapid computational and technological developments made large amounts of omics data available in different biological levels. It is becoming clear that simultaneous data analysis methods are needed for better interpretation and understanding of the underlying systems biology. Different methods have been proposed for this task, among them Partial Least Squares (PLS) related methods. To also deal with orthogonal variation, systematic variation in the data unrelated to one another, we consider the Two-way Orthogonal PLS (O2PLS): an integrative data analysis method which is capable of modeling systematic variation, while providing more parsimonious models aiding interpretation. Results: A simulation study to assess the performance of O2PLS showed positive results in both low and higher dimensions. More noise (50 % of the data) only affected the systematic part estimates. A data analysis was conducted using data on metabolomics and transcriptomics from a large Finnish cohort (DILGOM). A previous sequential study, using the same data, showed significant correlations between the Lipo-Leukocyte (LL) module and lipoprotein metabolites. The O2PLS results were in agreement with these findings, identifying almost the same set of co-varying variables. Moreover, our integrative approach identified other associative genes and metabolites, while taking into account systematic variation in the data. Including orthogonal components enhanced overall fit, but the orthogonal variation was difficult to interpret. Conclusions: Simulations showed that the O2PLS estimates were close to the true parameters in both low and higher dimensions. In the presence of more noise (50 %), the orthogonal part estimates could not distinguish well between joint and unique variation. The joint estimates were not systematically affected. Simultaneous analysis with O2PLS on metabolome and transcriptome data showed that the LL module, together with VLDL and HDL metabolites, were important for the metabolomic and transcriptomic relation. This is in agreement with an earlier study. In addition more gene expression and metabolites are identified being important for the joint covariation

Crossref

TU Delft Repository

Springer - Publisher Connector

Julkari

PubMed Central

Leiden University Scholary Publications

White Rose Research Online

DISCO-SCA and Properly Applied GSVD as Swinging Methods to Find Common and Distinctive Processes

Author: A Subramanian
A Tanay
Age K. Smilde
AK Smilde
Anna Tramontano
Bart De Moor
C Hennig
CC Paige
CF Van Loan
HAL Kiers
HAL Kiers
HAL Kiers
Henk A. L. Kiers
I Måge
IT Jolliffe
Iven Van Mechelen
J Ihmels
J Westerhuis
JA Hageman
JM Stuart
K Devarajan
K Lemmens
K Van Deun
KA Bernstein
Katrijn Van Deun
Lieven De Lathauwer
Lieven Thorrez
M Schouteden
Mariët J. van der Werf
Martijn Schouteden
ME Timmerman
MJ van der Werf
MW Browne
NS Holter
O Alter
P Howland
P Tamayo
RA van den Berg
S Bergmann
S Friedland
SP Ponnapalli
T Dahl
T Löfstedt
U Lorenzo-Seva
VK Mootha
Z Bai
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

BACKGROUND: In systems biology it is common to obtain for the same set of biological entities information from multiple sources. Examples include expression data for the same set of orthologous genes screened in different organisms and data on the same set of culture samples obtained with different high-throughput techniques. A major challenge is to find the important biological processes underlying the data and to disentangle therein processes common to all data sources and processes distinctive for a specific source. Recently, two promising simultaneous data integration methods have been proposed to attain this goal, namely generalized singular value decomposition (GSVD) and simultaneous component analysis with rotation to common and distinctive components (DISCO-SCA). RESULTS: Both theoretical analyses and applications to biologically relevant data show that: (1) straightforward applications of GSVD yield unsatisfactory results, (2) DISCO-SCA performs well, (3) provided proper pre-processing and algorithmic adaptations, GSVD reaches a performance level similar to that of DISCO-SCA, and (4) DISCO-SCA is directly generalizable to more than two data sources. The biological relevance of DISCO-SCA is illustrated with two applications. First, in a setting of comparative genomics, it is shown that DISCO-SCA recovers a common theme of cell cycle progression and a yeast-specific response to pheromones. The biological annotation was obtained by applying Gene Set Enrichment Analysis in an appropriate way. Second, in an application of DISCO-SCA to metabolomics data for Escherichia coli obtained with two different chemical analysis platforms, it is illustrated that the metabolites involved in some of the biological processes underlying the data are detected by one of the two platforms only; therefore, platforms for microbial metabolomics should be tailored to the biological question. CONCLUSIONS: Both DISCO-SCA and properly applied GSVD are promising integrative methods for finding common and distinctive processes in multisource data. Open source code for both methods is provided

University of Groningen

Directory of Open Access Journals

UvA-DARE

International Migration, Integration and Social Cohesion online publications

FigShare

Public Library of Science (PLOS)

Crossref

Proceedings - University of Groningen

ARTS repository - University of Groningen

PubMed Central

Dissertations of the University of Groningen