Search CORE

17 research outputs found

Improving Software Engineering in Biostatistics: Challenges and Opportunities

Author: Fillinger Sven
Nahnsen Sven
Publication venue: arXiv
Publication date: 24/01/2023
Field of study

Publikationsserver der Universität Tübingen

Improving Software Engineering in Biostatistics: Challenges and Opportunities

Author: Fillinger Sven
Nahnsen Sven
Publication venue: arXiv
Publication date: 24/01/2023
Field of study

Publikationsserver der Universität Tübingen

Challenges of big data integration in the life sciences

Author: De la Garza Luis
Fillinger Sven
Kohlbacher Oliver
Nahnsen Sven
Peltzer Alexander
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Big data has been reported to be revolutionizing many areas of life, including science. It summarizes data that is unprecedentedly large, rapidly generated, heterogeneous, and hard to accurately interpret. This availability has also brought new challenges: How to properly annotate data to make it searchable? What are the legal and ethical hurdles when sharing data? How to store data securely, preventing loss and corruption? The life sciences are not the only disciplines that must align themselves with big data requirements to keep up with the latest developments. The large hadron collider, for instance, generates research data at a pace beyond any current biomedical research center. There are three recent major coinciding events that explain the emergence of big data in the context of research: the technological revolution for data generation, the development of tools for data analysis, and a conceptual change towards open science and data. The true potential of big data lies in pattern discovery in large datasets, as well as the formulation of new models and hypotheses. Confirmation of the existence of the Higgs boson, for instance, is one of the most recent triumphs of big data analysis in physics. Digital representations of biological systems have become more comprehensive. This, in combination with advances in machine learning, creates exciting new research possibilities. In this paper, we review the state of big data in bioanalytical research and provide an overview of the guidelines for its proper usage

Publikationsserver der Universität Tübingen

MPG.PuRe

Targeted manipulation of bZIP53 DNA-binding properties influences Arabidopsis metabolism and growth

Author: Chaban Christina
Fillinger Sven
Garg Abhroop
Kirchler Tobias
Ladwig Friederike
Stadelhofer Bettina
Stahl Mark
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

Publikationsserver der Universität Tübingen

Absence of Non-Canonical, Inhibitory MYD88 Splice Variants in B Cell Lymphomas Correlates With Sustained NF-kappa B Signaling

Author: Admard Jakob
Cardona Gloria Yamel
Dickhöfer Sabine
Fillinger Sven
Nahnsen Sven
Ossowski Stephan
Weber Alexander N. R.
Wolz Olaf-Oliver
Publication venue: Frontiers Media Sa
Publication date: 01/01/2021
Field of study

Publikationsserver der Universität Tübingen

dRNA-seq Transcriptional Profiling of the FK506 Biosynthetic Gene Cluster in Streptomyces tsukubaensis NRRL18488 and General Analysis of the Transcriptome

Author: Apel Alexander Kristian
Bauer Judith Susanna
Fillinger Sven
Flinspach Katrin
Groß Harald
Jones Adam C.
Nieselt Kay
Publication venue: 'Informa UK Limited'
Publication date: 01/07/2017
Field of study

Publikationsserver der Universität Tübingen

Improving Software Engineering in Biostatistics: Challenges and Opportunities

Author: Boix Oliver
Boulesteix Anne-Laure
Bové Daniel Sabanés
Fillinger Sven
Gasparini Alessandro
Guünhan Burak K.
Jacob Anna E.
Jaki Thomas
Manitz Juliane
Nahnsen Sven
Schuüler Armin
Seibold Heidi
Publication venue: arXiv
Publication date: 01/01/2023
Field of study

Programming is ubiquitous in applied biostatistics; adopting software engineering skills will help biostatisticians do a better job. To explain this, we start by highlighting key challenges for software development and application in biostatistics. Silos between different statistician roles, projects, departments, and organizations lead to the development of duplicate and suboptimal code. Building on top of open-source software requires critical appraisal and risk-based assessment of the used modules. Code that is written needs to be readable to ensure reliable software. The software needs to be easily understandable for the user, as well as developed within testing frameworks to ensure that long term maintenance of the software is feasible. Finally, the reproducibility of research results is hindered by manual analysis workflows and uncontrolled code development. We next describe how the awareness of the importance and application of good software engineering practices and strategies can help address these challenges. The foundation is a better education in basic software engineering skills in schools, universities, and during the work life. Dedicated software engineering teams within academic institutions and companies can be a key factor for the establishment of good software engineering practices and catalyze improvements across research projects. Providing attractive career paths is important for the retainment of talents. Readily available tools can improve the reproducibility of statistical analyses and their use can be exercised in community events. [...

Juelich Shared Electronic Resources

A data management infrastructure for the integration of imaging and omics data in life sciences

Author: Bitzer Michael
De la Garza Luis
Fillinger Sven
Friedrich Andreas
Gabernet Gisela
Harter Klaus
Horger Marius Stefan
Koch Tobias
Kuhn Cuellar Luis Eugenio
Malek Nisar Peter
Nahnsen Sven
Richter Sandra
Seyboldt Adrian
Thaiss Wolfgang Maximilian
Wanke Friederike
Zur Oven-Krockhaus Sven
Publication venue: Bmc
Publication date: 01/01/2022
Field of study

BACKGROUND: As technical developments in omics and biomedical imaging increase the throughput of data generation in life sciences, the need for information systems capable of managing heterogeneous digital assets is increasing. In particular, systems supporting the findability, accessibility, interoperability, and reusability (FAIR) principles of scientific data management. RESULTS: We propose a Service Oriented Architecture approach for integrated management and analysis of multi-omics and biomedical imaging data. Our architecture introduces an image management system into a FAIR-supporting, web-based platform for omics data management. Interoperable metadata models and middleware components implement the required data management operations. The resulting architecture allows for FAIR management of omics and imaging data, facilitating metadata queries from software applications. The applicability of the proposed architecture is demonstrated using two technical proofs of concept and a use case, aimed at molecular plant biology and clinical liver cancer research, which integrate various imaging and omics modalities. CONCLUSIONS: We describe a data management architecture for integrated, FAIR-supporting management of omics and biomedical imaging data, and exemplify its applicability for basic biology research and clinical studies. We anticipate that FAIR data management systems for multi-modal data repositories will play a pivotal role in data-driven research, including studies which leverage advanced machine learning methods, as the joint analysis of omics and imaging data, in conjunction with phenotypic metadata, becomes not only desirable but necessary to derive novel insights into biological processes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04584-3

Publikationsserver der Universität Tübingen

PubMed Central

dRNA-seq transcriptional profiling of the FK506 biosynthetic gene cluster in Streptomyces tsukubaensis

Author: Adam C. Jones
Alexander Herbig
Alexander K. Apel
Bailey TL
Cynthia Sharma
Harald Gross
Judith S. Bauer
Katrin Flinspach
Kay Nieselt
Konrad Förstner
Maddess ML
Sven Fillinger
Wallemacq PE
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

dRNA-seq transcriptional profiling of the FK506 biosynthetic gene cluster in Streptomyces tsukubaensis NRRL18488 and general analysis of the transcriptome

Author: Adam C. Jones (432837)
Alexander Herbig (413039)
Alexander K. Apel (531105)
Cynthia Sharma (544837)
Harald Gross (571760)
Judith S. Bauer (571759)
Katrin Flinspach (531102)
Kay Nieselt (27239)
Konrad Förstner (822551)
Sven Fillinger (4216972)
Publication venue
Publication date
Field of study

FK506 (tacrolimus) is a valuable immunosuppressant produced by several Streptomyces strains. In the genome of the wild type producer Streptomyces tsukubaensis NRRL18488, FK506 biosynthesis is encoded by a gene cluster that spans 83.5 (kb). A whole transcriptome differential shotgun sequencing (dRNA-seq) of S. tsukubaensis was performed to analyze transcription at 2 different time points; before and during active FK506 production. In total, 8,914 transcription start sites were identified in either condition, which enabled precise determination of the 5′-UTR length of the corresponding transcripts as well as the identification of 2 consensus sequence motifs in the promoter regions. The transcription start sites of all gene operons within the FK506 cluster were identified, including 3 examples of leaderless RNA transcripts. These data provide detailed insight into the transcription of the FK506 biosynthetic gene cluster to support future regulatory studies, genetic manipulation, and industrial production.</p

FigShare