152 research outputs found
A conjugate prior for discrete hierarchical log-linear models
In Bayesian analysis of multi-way contingency tables, the selection of a
prior distribution for either the log-linear parameters or the cell
probabilities parameters is a major challenge. In this paper, we define a
flexible family of conjugate priors for the wide class of discrete hierarchical
log-linear models, which includes the class of graphical models. These priors
are defined as the Diaconis--Ylvisaker conjugate priors on the log-linear
parameters subject to "baseline constraints" under multinomial sampling. We
also derive the induced prior on the cell probabilities and show that the
induced prior is a generalization of the hyper Dirichlet prior. We show that
this prior has several desirable properties and illustrate its usefulness by
identifying the most probable decomposable, graphical and hierarchical
log-linear models for a six-way contingency table.Comment: Published in at http://dx.doi.org/10.1214/08-AOS669 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Two-state spin systems with negative interactions
We study the approximability of computing the partition functions of two-state spin systems. The problem is parameterized by a 2 × 2 symmetric matrix. Previous results on this problem were restricted either to the case where the matrix has non-negative entries, or to the case where the diagonal entries are equal, i.e. Ising models. In this paper, we study the generalization to arbitrary 2 × 2 interaction matrices with real entries. We show that in some regions of the parameter space, it’s #P-hard to even determine the sign of the partition function, while in other regions there are fully polynomial approximation schemes for the partition function. Our results reveal several new computational phase transitions
Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First
Graph Neural Networks (GNNs) have achieved great success in representing data
with dependencies by recursively propagating and aggregating messages along the
edges. However, edges in real-world graphs often have varying degrees of
difficulty, and some edges may even be noisy to the downstream tasks.
Therefore, existing GNNs may lead to suboptimal learned representations because
they usually treat every edge in the graph equally. On the other hand,
Curriculum Learning (CL), which mimics the human learning principle of learning
data samples in a meaningful order, has been shown to be effective in improving
the generalization ability and robustness of representation learners by
gradually proceeding from easy to more difficult samples during training.
Unfortunately, existing CL strategies are designed for independent data samples
and cannot trivially generalize to handle data dependencies. To address these
issues, we propose a novel CL strategy to gradually incorporate more edges into
training according to their difficulty from easy to hard, where the degree of
difficulty is measured by how well the edges are expected given the model
training status. We demonstrate the strength of our proposed method in
improving the generalization ability and robustness of learned representations
through extensive experiments on nine synthetic datasets and nine real-world
datasets. The code for our proposed method is available at
https://github.com/rollingstonezz/Curriculum_learning_for_GNNs.Comment: Accepted by NeurIPS 202
Interrupting The Propaganda Supply Chain
In this early-stage research, a multidisciplinary approach is presented for the detection of propaganda in the media, and for modeling the spread of propaganda and disinformation using semantic web and graph theory. An ontology will be designed which has the theoretical underpinnings from multiple disciplines including the social sciences and epidemiology. An additional objective of this work is to automate triple extraction from unstructured text which surpasses the state-of-the-art performance
Recommended from our members
Mini-Workshop: Dimers, Ising and Spanning Trees beyond the Critical Isoradial Case (online meeting)
The goal of this mini-workshop is to gather specialists of the dimer, Ising and spanning tree models around recent and ongoing progress in two directions. One is understanding the connection to the spectral curve of these models in the cases when the curve has positive genus. The other is the introduction of universal embeddings associated to these models. We aim to use these new tools to progress in the study of scaling limits
Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools
There has been considerable growth and interest in industrial applications of
machine learning (ML) in recent years. ML engineers, as a consequence, are in
high demand across the industry, yet improving the efficiency of ML engineers
remains a fundamental challenge. Automated machine learning (AutoML) has
emerged as a way to save time and effort on repetitive tasks in ML pipelines,
such as data pre-processing, feature engineering, model selection,
hyperparameter optimization, and prediction result analysis. In this paper, we
investigate the current state of AutoML tools aiming to automate these tasks.
We conduct various evaluations of the tools on many datasets, in different data
segments, to examine their performance, and compare their advantages and
disadvantages on different test cases
Orbiter rarefied-flow reentry measurements from the OARE on STS-62
Acceleration data taken from the Orbital Acceleration Research Experiment (OARE) during reentry on STS-62 has been analyzed using calibration factors taken on-orbit. The data includes the flight regime from orbital altitudes down to about 100 km which covers the free-molecule-flow regime and some of the flow-transition into the hypersonic continuum. Ancillary data on orbiter position, orientation, velocity, and rotation rates have been used in models to transform the measured accelerations to the orbiter center-of-gravity, from which aerodynamic accelerations along the orbiter body axes have been calculated. Additional steps are discussed which remove residual offsets introduced in the measurements by unmodeled orbiter forces. The resulting aerodynamic accelerations and their ratios, A(sub z)/A(sub x), are discussed and compared with free-molecule-flow predictions of the aerodynamic coefficient ratios C(sub N)/C(sub A)
- …