Search CORE

4 research outputs found

Interoperable and scalable data analysis with microservices: applications in metabolomics.

Author: Bergmann S.
Burman J.
Capuccini M.
Carone M.
Cascante M.
de Atauri P.
Emami Khoonsari P.
Foguet C.
Gonzalez-Beltran A.N.
Hankemeier T.
Haug K.
He S.
Herman S.
Johnson D.
Kale N.
Kultima K.
Larsson A.
Moreno P.
Neumann S.
Peters K.
Pireddu L.
Rocca-Serra P.
Roger P.
Rueedi R.
Ruttkies C.
Sadawi N.
Salek R.M.
Sansone S.A.
Schober D.
Selivanov V.
Spjuth O.
Steinbeck C.
Thévenot E.A.
van Vliet M.
Zanetti G.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed using the Kubernetes container orchestrator. We developed a Virtual Research Environment (VRE) which facilitates rapid integration of new tools and developing scalable and interoperable workflows for performing metabolomics data analysis. The environment can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry, one nuclear magnetic resonance spectroscopy and one fluxomics study. We showed that the method scales dynamically with increasing availability of computational resources. We demonstrated that the method facilitates interoperability using integration of the major software suites resulting in a turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, statistics and identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science. The PhenoMeNal consortium maintains a web portal (https://portal.phenomenal-h2020.eu) providing a GUI for launching the Virtual Research Environment. The GitHub repository https://github.com/phnmnl/ hosts the source code of all projects. Supplementary data are available at Bioinformatics online

Serveur académique lausannois

Publikationer från Uppsala Universitet

Oxford University Research Archive

Leiden University Scholary Publications

Digitala Vetenskapliga Arkivet - Academic Archive On-line

HAL-CEA

Diposit Digital de la Universitat de Barcelona

The machine learning life cycle and the cloud: implications for drug discovery

Author: Ahmed L
Batool M
Biology MV
Chan HCS
Chandrasekaran SN
Dalpé G
Di Tommaso P
Dreiman GHS
Eisenstein M
Emami Khoonsari P
Gudivada V
Kensert A
Keshavarzi Arshadi A
Kim H
Lapins M
Ma’ayan A
Moghadam BT
Mok NY
Nayarisseri A
Novella JA
Peters K
Sculley D
Sheller MJ
Sobeslav V
Stokes JM
Svensson F
Toor S
Valerio LG
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

PhenoMeNal: processing and analysis of metabolomics data in the cloud

Author: Bergmann S
Bradbury J
Capuccini M
Cascante M
De Atauri P
Ebbels T
Emami Khoonsari P
Foguet C
Glen R
Gonzalez-Beltran A
Guenther U
Handakas E
Hankemeier T
Haug K
Herman S
Holub P
Izzo M
Jacob D
Johnson D
Jourdan F
Kale N
Karaman I
Khalili B
Kultima K
Lampa S
Larsson A
Ludwig C
Moreno P
Neumann S
Novella JA
O'Donovan C
Pearce JTM
Peluso A
Peters K
Piras ME
Pireddu L
Reed MAC
Rocca-Serra P
Roger P
Rosato A
Rueedi R
Ruttkies C
Sadawi N
Salek R
Sansone S-A
Schober D
Selivanov V
Spjuth O
Steinbeck C
Thévenot E
Tomasoni M
Van Rijswijk M
Van Vliet M
Viant M
Weber R
Zanetti G
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

Background: Metabolomics is the comprehensive study of a multitude of small molecules to gain insight into an organism's metabolism. The research field is dynamic and expanding with applications across biomedical, biotechnological and many other applied biological domains. Its computationally-intensive nature has driven requirements for open data formats, data repositories and data analysis tools. However, the rapid progress has resulted in a mosaic of independent, and sometimes incompatible, analysis methods that are difficult to connect into a useful and complete data analysis solution. Findings: The PhenoMeNal (Phenome and Metabolome aNalysis) e-infrastructure provides a complete, workflow-oriented, interoperable metabolomics data analysis solution for a modern infrastructure-as-a-service (IaaS) cloud platform. PhenoMeNal seamlessly integrates a wide array of existing open source tools which are tested and packaged as Docker containers through the project's continuous integration process and deployed based on a kubernetes orchestration framework. It also provides a number of standardized, automated and published analysis workflows in the user interfaces Galaxy, Jupyter, Luigi and Pachyderm. Conclusions: PhenoMeNal constitutes a keystone solution in cloud infrastructures available for metabolomics. It provides scientists with a ready-to-use, workflow-driven, reproducible and shareable data analysis platform harmonizing the software installation and configuration through user-friendly web interfaces. The deployed cloud environments can be dynamically scaled to enable large-scale analyses which are interfaced through standard data formats, versioned, and have been tested for reproducibility and interoperability. The flexible implementation of PhenoMeNal allows easy adaptation of the infrastructure to other application areas and 'omics research domains

Oxford University Research Archive

Analysis of the Cerebrospinal Fluid Proteome in Alzheimer's Disease

Author: A Haggmark
A Zhou
AH America
AH Simonsen
Anna Häggmark
AW Henkel
AW Stoker
B Kobe
B Ma
B Stevens
B Worley
CE Teunissen
CF Hwang
CO Arregui
CP Ferri
CW Wu
D Shteynberg
D Van Vactor
DC Chamrad
E Masliah
F Abdi
F Song
G Shevchenko
Ganna Shevchenko
GK Smyth
GK Smyth
GK Smyth
GN Yin
H Jahn
H Tumani
H Weisser
J Ai
J Cox
J Cox
J Quackenbush
J Zhang
JB Coble
JB Toledo
JD Andersen
JH Kang
JL Bixby
Jonas Bergquist
K Blennow
K Blennow
K Kultima
Kim Kultima
Kristel Sleegers
L McHugh
L Zheng
LA Echan
Lars Lannfelt
LE Donovan
Lena Kilander
LG Johnsen
LM Boulanger
LY Geer
M Brosch
M Puchades
M Sturm
M Sturm
Maria Lönnberg
Maria Mikus
Martin Ingelsson
MJ Garton
N Colaert
N Mattsson
N Takahashi
O Hansson
P Perez-Pinera
P Plomgaard
P Podlesniy
Payam Emami Khoonsari
Peter Nilsson
R Craig
R Günther
R Kairouz
R Luo
R Moulder
R Timpl
RD Terry
S Banerjee
S Beranova-Giorgianni
S Calza
S Cappadona
S Chen
S de Vega
S Musunuri
S Nahnsen
S Xu
SF Hansson
SK Kwon
VP Andreev
W Lin
Y Han
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref