4 research outputs found

    MetaboCloud : A catalog of microservices hosted on a Cloud infrastructure and addressing issues linked to FAIR principles and open science

    No full text
    International audienceMetabolomics, the study of small molecules called metabolites, is a field generating massive and complex data that needs to be processed and interpreted. However, this requires overcoming several challenges, such as data manipulation, where the heterogeneity of technologies makes it difficult to standardize the methods and tools, as well as molecule annotation which is still a major bottleneck nowadays. Bioinformatics therefore plays an important role in answering these issues. However, it also brings its own set of questions, in terms of interoperability and reproducibility for example.In this context, new resources must be put in place to address the needs related to these questions. Consideration about FAIR (Findable, Accessible, Interoperable, Reusable) principles and open science has led to the initiation of MetaboCloud; a collaborative project between teams from the “Exploration of Metabolism Platform” (PFEM) member of the MetaboHUB infrastructure, the Auvergne Bioinformatics (AuBi) platform and the Mesocentre from the Clermont Auvergne Univercity (UCA). It aims to provide a set of bioinformatics tools, in metabolomics as a start, in the form of microservices, hosted on a Cloud infrastructure. It also intends to serve as a proof of concept, and to create a recipe to share with the bioinformatics community, integrating best practices in terms of code and deployment, thus ensuring high service quality and easy maintainability. The MetaboCloud microservices infrastructure is based on (1) bioinformatics tools, (2) from scratch API development if tools are not available, (3) an advanced CI/CD work environment managing the construction of a docker image, (4) taken all together in an OpenStack cloud technology environment. A roadmap containing about ten microservices has been drawn up for this project. Three of them are currently open to the community for use. Two have been developed using the Java language and the SpringBoot web framework. One is based on the CDK tool, which offers several functionalities using structural information (InChI, MOL or SDF) as input data. Firstly, it can return chemical properties of a compound, such as masses, SMILES, InChIs, the logP and the formula. It can also convert the compound’s structural information into another format such as InChI Key, InChI, MOL or SDF. Finally, it can depict a molecule, which means returning its PNG or SVG image. The other microservice is based on the InChI tool, which can, using the same type of input that CDK, generate the InChI and the InChI Key of a compound. The third microservice, derived from the Goslin tool, has been implemented in Python using the Falcon web framework. It can be used to transform a common lipid name into a standardized one. All microservices methods are described in OpenAPI standardized format, which enable anyone to generate code to query them in a large panel of programming languages. Each microservice has its own Docker container.These microservices can be used in different ways. They can be integrated into an application, or they can be used on their own or combined with other microservices inside a script. Furthermore, web components which are a collection of functionalities establishing a standardized component model for the web, enabling the encapsulation and interoperability of individual HTML elements, also developed within PFEM, are available for use as clients to query each of the microservices. These web components are available in a npm library.The development of bioinformatics tools in the form of microservices therefore offers a number of advantages. In particular, from an interoperability point of view, as they can be queried from any programming language. It also addresses issues of reproducibility, since the versioning of a microservice is controlled by its containerization. Moreover, a web portal, referenced in bio.tools, has been created to make all developed applications accessible, associated with their documentation and metadata, thus addressing the "Accessible" dimension of the FAIR principles

    FAIRing Research Data to Live @AuBi Platform

    No full text
    International audienceHosted by the Mésocentre Clermont-Auvergne [1], the Auvergne Bioinformatique (AuBi) platform is member of the Institut Français de Bioinformatique (IFB) [2]. Around 15 labs contributes to AuBi activities with scientific questions related to health, microbes, agronomy and environment disciplines. Despite a high diversity a research questioning, data that are produced, processed and analyzed belongs to four main categories : sequences, images, qualitative and quantitative data and digital models. We aim at sharing expertise and knowledge in handling these kind of large-scale data with computing and storage facilities administrated at Mesocentre.An update of general services are presented here in terms of computing and storage as well as our prospects in the coming months. Recently opened services like jupyter notebooks and gitlab server for code tracking are keys points towards reproducible sciences. In addition, our concern about security and safety data storage guided us for proposing a new storage services accessible from any labs on our campus. Among services administrated for biologists and bioinformaticians are mature tools, an OMERO [3] server for data visualization and (meta)-data storage as well as the bio-analysis platform Galaxy [4]. In addition to the administration and technical support, we are contributing to scientific projects, such as the microservice web portal for the PlateForme d’Exploration du métabolisme. The project started a year ago and we are offering an infrastructure and our expertise in tool development to improve metabolomic annotation and analysis from a cloud infrastructure. In addition, through the IFB, we interact with FAIRChecker [5] development team to improve the FAIRness of our microservices web services. A poster to specifically detail the project has been submitted.Considering the data lifecycle, we are collaborating on 3 main IFB project. Firstly, one project initiated with the Projet-flash ANR machine-actionable DMP for Life Sciences (maDMP4LS) and the Institut de l’lnformation Scientifique et Technique (INIST) team ended up into the produce a web form connected to OPIDoR DMP, to help users to estimate project cost on the Mesocentre. Secondly, as part of the MuDiS4LS project, Data Stewart Wizard project was used to develop DMP dedicated to platforms: images produced and annotated on the CLermont Imagerie Confocal platform [6] as well as images stored and analyzed on omero.mesocentre.uca.fr. Thirdly, we contributed to the previously OpenLink project renamed as MadBot [7] offering a web portal orchestrating project information from data production to data publication with a view and link to where data are stored (Galaxy, OMERO, ...)To support users towards open science, FAIR principles and reproducibility, we are working together with the “Open Science Unit” and the IFB training team, investing in training sessions. Training are oriented toward best practices in using the Mesocenter infrastructures as well as raising awareness about the proper use of metadata while storing image data. We are also offering training sessions for bioinformaticians for best practices (FAIR-bioinfo) and for biologists with the Galaxy Training Network with topics of interest for the AuBi community.All our actions are Open Science oriented. This is challenging as actions have to be coordinated and in agreement with the University and research Institutes strategies, the Information Systems Management and the GDPR regulations, and considering diversity of research programs and various types of produced and stored data.AcknowledgementsINIST, maDMP4LS, IS1-MuDiS4LS and MadBot teamsReferences1.www.mesocentre.uca.fr2.www.france-bioinformatique.fr3.http://www.openmicroscopy.org/4.Afgan et al. (2018) https://doi.org/10.1093/nar/gky3795.Gaignard et al. (2023) https://doi.org/10.1186/s13326-023-00289-56.https://ressources.france-bioinformatique.fr/fr/plateformes/aubi7.https://www.france-bioinformatique.fr/science-ouverte

    FAIRing Research Data to Live @AuBi Platform

    No full text
    International audienceHosted by the Mésocentre Clermont-Auvergne [1], the Auvergne Bioinformatique (AuBi) platform is member of the Institut Français de Bioinformatique (IFB) [2]. Around 15 labs contributes to AuBi activities with scientific questions related to health, microbes, agronomy and environment disciplines. Despite a high diversity a research questioning, data that are produced, processed and analyzed belongs to four main categories : sequences, images, qualitative and quantitative data and digital models. We aim at sharing expertise and knowledge in handling these kind of large-scale data with computing and storage facilities administrated at Mesocentre.An update of general services are presented here in terms of computing and storage as well as our prospects in the coming months. Recently opened services like jupyter notebooks and gitlab server for code tracking are keys points towards reproducible sciences. In addition, our concern about security and safety data storage guided us for proposing a new storage services accessible from any labs on our campus. Among services administrated for biologists and bioinformaticians are mature tools, an OMERO [3] server for data visualization and (meta)-data storage as well as the bio-analysis platform Galaxy [4]. In addition to the administration and technical support, we are contributing to scientific projects, such as the microservice web portal for the PlateForme d’Exploration du métabolisme. The project started a year ago and we are offering an infrastructure and our expertise in tool development to improve metabolomic annotation and analysis from a cloud infrastructure. In addition, through the IFB, we interact with FAIRChecker [5] development team to improve the FAIRness of our microservices web services. A poster to specifically detail the project has been submitted.Considering the data lifecycle, we are collaborating on 3 main IFB project. Firstly, one project initiated with the Projet-flash ANR machine-actionable DMP for Life Sciences (maDMP4LS) and the Institut de l’lnformation Scientifique et Technique (INIST) team ended up into the produce a web form connected to OPIDoR DMP, to help users to estimate project cost on the Mesocentre. Secondly, as part of the MuDiS4LS project, Data Stewart Wizard project was used to develop DMP dedicated to platforms: images produced and annotated on the CLermont Imagerie Confocal platform [6] as well as images stored and analyzed on omero.mesocentre.uca.fr. Thirdly, we contributed to the previously OpenLink project renamed as MadBot [7] offering a web portal orchestrating project information from data production to data publication with a view and link to where data are stored (Galaxy, OMERO, ...)To support users towards open science, FAIR principles and reproducibility, we are working together with the “Open Science Unit” and the IFB training team, investing in training sessions. Training are oriented toward best practices in using the Mesocenter infrastructures as well as raising awareness about the proper use of metadata while storing image data. We are also offering training sessions for bioinformaticians for best practices (FAIR-bioinfo) and for biologists with the Galaxy Training Network with topics of interest for the AuBi community.All our actions are Open Science oriented. This is challenging as actions have to be coordinated and in agreement with the University and research Institutes strategies, the Information Systems Management and the GDPR regulations, and considering diversity of research programs and various types of produced and stored data.AcknowledgementsINIST, maDMP4LS, IS1-MuDiS4LS and MadBot teamsReferences1.www.mesocentre.uca.fr2.www.france-bioinformatique.fr3.http://www.openmicroscopy.org/4.Afgan et al. (2018) https://doi.org/10.1093/nar/gky3795.Gaignard et al. (2023) https://doi.org/10.1186/s13326-023-00289-56.https://ressources.france-bioinformatique.fr/fr/plateformes/aubi7.https://www.france-bioinformatique.fr/science-ouverte

    FAIRing Research Data to Live @AuBi Platform

    No full text
    International audienceHosted by the Mésocentre Clermont-Auvergne [1], the Auvergne Bioinformatique (AuBi) platform is member of the Institut Français de Bioinformatique (IFB) [2]. Around 15 labs contributes to AuBi activities with scientific questions related to health, microbes, agronomy and environment disciplines. Despite a high diversity a research questioning, data that are produced, processed and analyzed belongs to four main categories : sequences, images, qualitative and quantitative data and digital models. We aim at sharing expertise and knowledge in handling these kind of large-scale data with computing and storage facilities administrated at Mesocentre.An update of general services are presented here in terms of computing and storage as well as our prospects in the coming months. Recently opened services like jupyter notebooks and gitlab server for code tracking are keys points towards reproducible sciences. In addition, our concern about security and safety data storage guided us for proposing a new storage services accessible from any labs on our campus. Among services administrated for biologists and bioinformaticians are mature tools, an OMERO [3] server for data visualization and (meta)-data storage as well as the bio-analysis platform Galaxy [4]. In addition to the administration and technical support, we are contributing to scientific projects, such as the microservice web portal for the PlateForme d’Exploration du métabolisme. The project started a year ago and we are offering an infrastructure and our expertise in tool development to improve metabolomic annotation and analysis from a cloud infrastructure. In addition, through the IFB, we interact with FAIRChecker [5] development team to improve the FAIRness of our microservices web services. A poster to specifically detail the project has been submitted.Considering the data lifecycle, we are collaborating on 3 main IFB project. Firstly, one project initiated with the Projet-flash ANR machine-actionable DMP for Life Sciences (maDMP4LS) and the Institut de l’lnformation Scientifique et Technique (INIST) team ended up into the produce a web form connected to OPIDoR DMP, to help users to estimate project cost on the Mesocentre. Secondly, as part of the MuDiS4LS project, Data Stewart Wizard project was used to develop DMP dedicated to platforms: images produced and annotated on the CLermont Imagerie Confocal platform [6] as well as images stored and analyzed on omero.mesocentre.uca.fr. Thirdly, we contributed to the previously OpenLink project renamed as MadBot [7] offering a web portal orchestrating project information from data production to data publication with a view and link to where data are stored (Galaxy, OMERO, ...)To support users towards open science, FAIR principles and reproducibility, we are working together with the “Open Science Unit” and the IFB training team, investing in training sessions. Training are oriented toward best practices in using the Mesocenter infrastructures as well as raising awareness about the proper use of metadata while storing image data. We are also offering training sessions for bioinformaticians for best practices (FAIR-bioinfo) and for biologists with the Galaxy Training Network with topics of interest for the AuBi community.All our actions are Open Science oriented. This is challenging as actions have to be coordinated and in agreement with the University and research Institutes strategies, the Information Systems Management and the GDPR regulations, and considering diversity of research programs and various types of produced and stored data.AcknowledgementsINIST, maDMP4LS, IS1-MuDiS4LS and MadBot teamsReferences1.www.mesocentre.uca.fr2.www.france-bioinformatique.fr3.http://www.openmicroscopy.org/4.Afgan et al. (2018) https://doi.org/10.1093/nar/gky3795.Gaignard et al. (2023) https://doi.org/10.1186/s13326-023-00289-56.https://ressources.france-bioinformatique.fr/fr/plateformes/aubi7.https://www.france-bioinformatique.fr/science-ouverte
    corecore