58,535 research outputs found
Hierarchic Bayesian models for kernel learning
The integration of diverse forms of informative data by learning an optimal combination of base kernels in classification or regression problems can provide enhanced performance when compared to that obtained from any single data source. We present a Bayesian hierarchical model which enables kernel learning and present effective variational Bayes estimators for regression and classification. Illustrative experiments demonstrate the utility of the proposed method
Variational Bayesian multinomial probit regression with Gaussian process priors
It is well known in the statistics literature that augmenting binary and polychotomous response models with Gaussian latent variables enables exact Bayesian analysis via Gibbs sampling from the parameter posterior. By adopting such a data augmentation strategy, dispensing with priors over regression coefficients in favour of Gaussian Process (GP) priors over functions, and employing variational approximations to the full posterior we obtain efficient computational methods for Gaussian Process classification in the multi-class setting. The model augmentation with additional latent variables ensures full a posteriori class coupling whilst retaining the simple a priori independent GP covariance structure from which sparse approximations, such as multi-class Informative Vector Machines (IVM), emerge in a very natural and straightforward manner. This is the first time that a fully Variational Bayesian treatment for multi-class GP classification has been developed without having to resort to additional explicit approximations to the non-Gaussian likelihood term. Empirical comparisons with exact analysis via MCMC and Laplace approximations illustrate the utility of the variational approximation as a computationally economic alternative to full MCMC and it is shown to be more accurate than the Laplace approximation
Resource allocation using constraint propagation
The concept of constraint propagation was discussed. Performance increases are possible with careful application of these constraint mechanisms. The degree of performance increase is related to the interdependence of the different activities resource usage. Although this method of applying constraints to activities and resources is often beneficial, it is obvious that this is no panacea cure for the computational woes that are experienced by dynamic resource allocation and scheduling problems. A combined effort for execution optimization in all areas of the system during development and the selection of the appropriate development environment is still the best method of producing an efficient system
Efficient state-space inference of periodic latent force models
Latent force models (LFM) are principled approaches to incorporating solutions to differen-tial equations within non-parametric inference methods. Unfortunately, the developmentand application of LFMs can be inhibited by their computational cost, especially whenclosed-form solutions for the LFM are unavailable, as is the case in many real world prob-lems where these latent forces exhibit periodic behaviour. Given this, we develop a newsparse representation of LFMs which considerably improves their computational efficiency,as well as broadening their applicability, in a principled way, to domains with periodic ornear periodic latent forces. Our approach uses a linear basis model to approximate onegenerative model for each periodic force. We assume that the latent forces are generatedfrom Gaussian process priors and develop a linear basis model which fully expresses thesepriors. We apply our approach to model the thermal dynamics of domestic buildings andshow that it is effective at predicting day-ahead temperatures within the homes. We alsoapply our approach within queueing theory in which quasi-periodic arrival rates are mod-elled as latent forces. In both cases, we demonstrate that our approach can be implemented efficiently using state-space methods which encode the linear dynamic systems via LFMs.Further, we show that state estimates obtained using periodic latent force models can re-duce the root mean squared error to 17% of that from non-periodic models and 27% of thenearest rival approach which is the resonator model (S ̈arkk ̈a et al., 2012; Hartikainen et al.,2012.
A Framework for Quantifying the Degeneracies of Exoplanet Interior Compositions
Several transiting super-Earths are expected to be discovered in the coming
few years. While tools to model the interior structure of transiting planets
exist, inferences about the composition are fraught with ambiguities. We
present a framework to quantify how much we can robustly infer about
super-Earth and Neptune-size exoplanet interiors from radius and mass
measurements. We introduce quaternary diagrams to illustrate the range of
possible interior compositions for planets with four layers (iron core,
silicate mantles, water layers, and H/He envelopes). We apply our model to
CoRoT-7b, GJ 436b, and HAT-P-11b. Interpretation of planets with H/He envelopes
is limited by the model uncertainty in the interior temperature, while for
CoRoT-7b observational uncertainties dominate. We further find that our planet
interior model sharpens the observational constraints on CoRoT-7b's mass and
radius, assuming the planet does not contain significant amounts of water or
gas. We show that the strength of the limits that can be placed on a
super-Earth's composition depends on the planet's density; for similar
observational uncertainties, high-density super-Mercuries allow the tightest
composition constraints. Finally, we describe how techniques from Bayesian
statistics can be used to take into account in a formal way the combined
contributions of both theoretical and observational uncertainties to
ambiguities in a planet's interior composition. On the whole, with only a mass
and radius measurement an exact interior composition cannot be inferred for an
exoplanet because the problem is highly underconstrained. Detailed quantitative
ranges of plausible compositions, however, can be found.Comment: 20 pages, 10 figures, published in Ap
Gasdermins in Apoptosis: New players in an Old Game.
Apoptosis is a form of programmed cell death (PCD) that plays critical physiological roles in removing superfluous or dangerous cell populations that are unneeded or threatening to the health of the host organism. Although the molecular pathways leading to activation of the apoptotic program have been extensively studied and characterized starting in the 1970s, new evidence suggests that members of the gasdermin superfamily are novel pore-forming proteins that augment apoptosis by permeabilizing the mitochondria and participate in the final stages of the apoptotic program by inducing secondary necrosis/pyroptosis. These findings may explain outstanding questions in the field such as why certain gasdermin members sensitize cells to apoptosis, and why some apoptotic cells also show morphological features of necrosis. Furthermore, the interplay between the gasdermins and apoptosis may also explain why genetic and epigenetic alterations in these genes cause diseases and disorders like cancer and hearing loss. This review focuses on our current understanding of the function of several gasdermin superfamily members, their role in apoptosis, and how they may contribute to pathophysiological conditions
Semi-parametric analysis of multi-rater data
Datasets that are subjectively labeled by a number of experts are becoming more common in tasks such as biological text annotation where class definitions are necessarily somewhat subjective. Standard classification and regression models are not suited to multiple labels and typically a pre-processing step (normally assigning the majority class) is performed. We propose Bayesian models for classification and ordinal regression that naturally incorporate multiple expert opinions in defining predictive distributions. The models make use of Gaussian process priors, resulting in great flexibility and particular suitability to text based problems where the number of covariates can be far greater than the number of data instances. We show that using all labels rather than just the majority improves performance on a recent biological dataset
- …