Search CORE

23 research outputs found

Generalized Multimodal ELBO

Author: Daunhawer Imant
Sutter Thomas M.
Vogt Julia E.
Publication venue
Publication date: 04/05/2021
Field of study

Multiple data types naturally co-occur when describing real-world phenomena and learning from them is a long-standing goal in machine learning research. However, existing self-supervised generative models approximating an ELBO are not able to fulfill all desired requirements of multimodal models: their posterior approximation functions lead to a trade-off between the semantic coherence and the ability to learn the joint data distribution. We propose a new, generalized ELBO formulation for multimodal data that overcomes these limitations. The new objective encompasses two previous methods as special cases and combines their benefits without compromises. In extensive experiments, we demonstrate the advantage of the proposed method compared to state-of-the-art models in self-supervised, generative learning tasks.Comment: 2021 ICL

arXiv.org e-Print Archive

Repository for Publications and Research Data

Multimodal Generative Learning Utilizing Jensen-Shannon-Divergence

Author: Daunhawer Imant
Sutter Thomas M.
Vogt Julia E.
Publication venue
Publication date: 02/11/2020
Field of study

Learning from different data types is a long-standing goal in machine learning research, as multiple information sources co-occur when describing natural phenomena. However, existing generative models that approximate a multimodal ELBO rely on difficult or inefficient training schemes to learn a joint distribution and the dependencies between modalities. In this work, we propose a novel, efficient objective function that utilizes the Jensen-Shannon divergence for multiple distributions. It simultaneously approximates the unimodal and joint multimodal posteriors directly via a dynamic prior. In addition, we theoretically prove that the new multimodal JS-divergence (mmJSD) objective optimizes an ELBO. In extensive experiments, we demonstrate the advantage of the proposed mmJSD model compared to previous work in unsupervised, generative learning tasks.Comment: Accepted at NeurIPS 2020, camera-ready versio

arXiv.org e-Print Archive

Decoupling State Representation Methods from Reinforcement Learning in Car Racing

Author: Daunhawer Imant
Montoya Juan M.
Vogt Julia E.
Wiering Marco
Publication venue: 'Scitepress'
Publication date: 01/01/2021
Field of study

In the quest for efficient and robust learning methods, combining unsupervised state representation learning and reinforcement learning (RL) could offer advantages for scaling RL algorithms by providing the models with a useful inductive bias. For achieving this, an encoder is trained in an unsupervised manner with two state representation methods, a variational autoencoder and a contrastive estimator. The learned features are then fed to the actor-critic RL algorithm Proximal Policy Optimization (PPO) to learn a policy for playing Open AI's car racing environment. Hence, such procedure permits to decouple state representations from RL-controllers. For the integration of RL with unsupervised learning, we explore various designs for variational autoencoders and contrastive learning. The proposed method is compared to a deep network trained directly on pixel inputs with PPO. The results show that the proposed method performs slightly worse than directly learning from pixel inputs; however, it has a more stable learning curve, a substantial reduction of the buffer size, and requires optimizing 88% fewer parameters. These results indicate that the use of pre-trained state representations has several benefits for solving RL tasks.</p

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Validating the early phototherapy prediction tool across cohorts

Author: Badura Anna
Daunhawer Imant
Michel Holger
Schumacher Kai
Vogt Julia E.
Wellmann Sven
Publication venue: Frontiers
Publication date: 09/10/2023
Field of study

Background: Hyperbilirubinemia of the newborn infant is a common disease worldwide. However, recognized early and treated appropriately, it typically remains innocuous. We recently developed an early phototherapy prediction tool (EPPT) by means of machine learning (ML) utilizing just one bilirubin measurement and few clinical variables. The aim of this study is to test applicability and performance of the EPPT on a new patient cohort from a different population. Materials and methods: This work is a retrospective study of prospectively recorded neonatal data from infants born in 2018 in an academic hospital, Regensburg, Germany, meeting the following inclusion criteria: born with 34 completed weeks of gestation or more, at least two total serum bilirubin (TSB) measurement prior to phototherapy. First, the original EPPT—an ensemble of a logistic regression and a random forest—was used in its freely accessible version and evaluated in terms of the area under the receiver operating characteristic curve (AUROC). Second, a new version of the EPPT model was re-trained on the data from the new cohort. Third, the predictive performance, variable importance, sensitivity and specificity were analyzed and compared across the original and re-trained models. Results: In total, 1,109 neonates were included with a median (IQR) gestational age of 38.4 (36.6–39.9) and a total of 3,940 bilirubin measurements prior to any phototherapy treatment, which was required in 154 neonates (13.9%). For the phototherapy treatment prediction, the original EPPT achieved a predictive performance of 84.6% AUROC on the new cohort. After re-training the model on a subset of the new dataset, 88.8% AUROC was achieved as evaluated by cross validation. The same five variables as for the original model were found to be most important for the prediction on the new cohort, namely gestational age at birth, birth weight, bilirubin to weight ratio, hours since birth, bilirubin value. Discussion: The individual risk for treatment requirement in neonatal hyperbilirubinemia is robustly predictable in different patient cohorts with a previously developed ML tool (EPPT) demanding just one TSB value and only four clinical parameters. Further prospective validation studies are needed to develop an effective and safe clinical decision support system

University of Regensburg Publication Server

Machine learning used to compare the diagnostic accuracy of risk factors, clinical signs and biomarkers and to develop a new prediction model for neonatal early-onset sepsis

Author: Achten Niek B.
Daunhawer Imant
De Mol Amerik C.
De Vries Esther
Donker Albertine E.
Dutta Sourabh
El Helou Salhab
Hoffmann-Haringsma Angelique
Janota Jan
Kornelisse René F.
Lehnick Dirk
Moonen Rob
Plötz Frans B.
Roy Madan
Schlapbach Luregn J.
Schuerman Frank A. B. A.
Sie Sintha D.
Stocker Martin
Tomaske Maren
Van Den Tooren-de Groot Rita K.
Van Der Meer-Kappelle Laura H.
Van Gijsel Juliette
Van Herk Wendy
Van Rossum Annemarie M. C.
Vogt Julia E.
Wellmann Sven
Wieringa Jantien W.
Zimmerman Urs
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 01/01/2022
Field of study

Background: Current strategies for risk stratification and prediction of neonatal early-onset sepsis (EOS) are inefficient and lack diagnostic performance. The aim of this study was to use machine learning to analyze the diagnostic accuracy of risk factors (RFs), clinical signs and biomarkers and to develop a prediction model for culture-proven EOS. We hypothesized that the contribution to diagnostic accuracy of biomarkers is higher than of RFs or clinical signs. Study Design: Secondary analysis of the prospective international multicenter NeoPInS study. Neonates born after completed 34 weeks of gestation with antibiotic therapy due to suspected EOS within the first 72 hours of life participated. Primary outcome was defined as predictive performance for culture-proven EOS with variables known at the start of antibiotic therapy. Machine learning was used in form of a random forest classifier. Results: One thousand six hundred eighty-five neonates treated for suspected infection were analyzed. Biomarkers were superior to clinical signs and RFs for prediction of culture-proven EOS. C-reactive protein and white blood cells were most important for the prediction of the culture result. Our full model achieved an area-under-the-receiver-operating-characteristic-curve of 83.41% (±8.8%) and an area-under-the-precision-recall-curve of 28.42% (±11.5%). The predictive performance of the model with RFs alone was comparable with random. Conclusions: Biomarkers have to be considered in algorithms for the management of neonates suspected of EOS. A 2-step approach with a screening tool for all neonates in combination with our model in the preselected population with an increased risk for EOS may have the potential to reduce the start of unnecessary antibiotics

EUR Research Repository

Tilburg University Repository

Multimodal Representation Learning under Weak Supervision

Author: Daunhawer Imant
Publication venue: ETH Zurich
Publication date: 01/01/2023
Field of study

Repository for Publications and Research Data

Biases in the Football Betting Market

Author: Daunhawer Imant
Kosub Sven
Schoch David
Publication venue
Publication date: 01/05/2017
Field of study

The University of Manchester - Institutional Repository

MMVAE+: Enhancing the Generative Quality of Multimodal VAEs without Compromises

Author: Daunhawer Imant
Palumbo Emanuele
Vogt Julia E.
Publication venue: OpenReview
Publication date: 01/01/2023
Field of study

Multimodal VAEs have recently gained attention as efficient models for weakly-supervised generative learning with multiple modalities. However, all existing variants of multimodal VAEs are affected by a non-trivial trade-off between generative quality and generative coherence. In particular mixture-based models achieve good coherence only at the expense of sample diversity and a resulting lack of generative quality. We present a novel variant of the mixture-of-experts multimodal variational autoencoder that improves its generative quality, while maintaining high semantic coherence. We model shared and modality-specific information in separate latent subspaces, proposing an objective that overcomes certain dependencies on hyperparameters that arise for existing approaches with the same latent space structure. Compared to these existing approaches, we show increased robustness with respect to changes in the design of the latent space, in terms of the capacity allocated to modality-specific subspaces. We show that our model achieves both good generative coherence and high generative quality in challenging experiments, including more complex multimodal datasets than those used in previous works

Repository for Publications and Research Data

Multimodal Generative Learning Utilizing Jensen-Shannon Divergence

Author: Daunhawer Imant
Sutter Thomas
Vogt Julia E.
Publication venue: Curran
Publication date: 01/01/2021
Field of study

Repository for Publications and Research Data

Multimodal Generative Learning Utilizing Jensen-Shannon Divergence

Author: Daunhawer Imant
Sutter Thomas
Vogt Julia E.
Publication venue: Curran
Publication date: 13/12/2019
Field of study

arXiv.org e-Print Archive

Repository for Publications and Research Data