863 research outputs found
Conditional Sum-Product Networks: Imposing Structure on Deep Probabilistic Architectures
Probabilistic graphical models are a central tool in AI; however, they are
generally not as expressive as deep neural models, and inference is notoriously
hard and slow. In contrast, deep probabilistic models such as sum-product
networks (SPNs) capture joint distributions in a tractable fashion, but still
lack the expressive power of intractable models based on deep neural networks.
Therefore, we introduce conditional SPNs (CSPNs), conditional density
estimators for multivariate and potentially hybrid domains which allow
harnessing the expressive power of neural networks while still maintaining
tractability guarantees. One way to implement CSPNs is to use an existing SPN
structure and condition its parameters on the input, e.g., via a deep neural
network. This approach, however, might misrepresent the conditional
independence structure present in data. Consequently, we also develop a
structure-learning approach that derives both the structure and parameters of
CSPNs from data. Our experimental evidence demonstrates that CSPNs are
competitive with other probabilistic models and yield superior performance on
multilabel image classification compared to mean field and mixture density
networks. Furthermore, they can successfully be employed as building blocks for
structured probabilistic models, such as autoregressive image models.Comment: 13 pages, 6 figure
Recommended from our members
True-false lumen segmentation of aortic dissection using multi-scale wavelet analysis and generative-discriminative model matching
Computer aided diagnosis in the medical image domain requires sophisticated probabilistic models to formulate quantitative behavior in image space. In the diagnostic process detailed knowledge of model performance with respect to accuracy, variability, and uncertainty is crucial. This challenge has lead to the fusion of two successful learning schools namely generative and discriminative learning. In this paper, we propose a generative-discriminative learning approach to predict object boundaries in medical image datasets. In our approach, we perform probabilistic model matching of both modeling domains to fuse into the prediction step appearance and structural information of the object of interest while exploiting the strength of both learning paradigms. In particular, we apply our method to the task of true-false lumen segmentation of aortic dissections an acute disease that requires automated quantification for assisted medical diagnosis. We report empirical results for true-false lumen discrimination of aortic dissection segmentation showing superior behavior of the hybrid generative-discriminative approach over their non hybrid generative counterpart
The Prototyping and Focused Discriminating Strategy for Pattern Recognition and one Instantiation: the MELIDIS System
This paper presents the Prototyping and Focused Discriminating (PFD) strategy for pattern recognition. This strategy takes benefits from the duality between model generation and discrimination. Both collaborate through a focusing mechanism that detects the conflicts between the class models and drive the discrimination. Classifiers based on this collaboration benefit from a set of useful properties. The MĂ©lidis system illustrates this strategy and extends its possibilities, using a fuzzy framework. As shown by experiments, the resulting system provides an interesting compromise between accuracy and compactness. Experiments also demonstrate the interest of the new strategy and of its focusing mechanism
Learning Independent Causal Mechanisms
Statistical learning relies upon data sampled from a distribution, and we
usually do not care what actually generated it in the first place. From the
point of view of causal modeling, the structure of each distribution is induced
by physical mechanisms that give rise to dependences between observables.
Mechanisms, however, can be meaningful autonomous modules of generative models
that make sense beyond a particular entailed data distribution, lending
themselves to transfer between problems. We develop an algorithm to recover a
set of independent (inverse) mechanisms from a set of transformed data points.
The approach is unsupervised and based on a set of experts that compete for
data generated by the mechanisms, driving specialization. We analyze the
proposed method in a series of experiments on image data. Each expert learns to
map a subset of the transformed data back to a reference distribution. The
learned mechanisms generalize to novel domains. We discuss implications for
transfer learning and links to recent trends in generative modeling.Comment: ICML 201
- …