Search CORE

152 research outputs found

A conjugate prior for discrete hierarchical log-linear models

Author: Dobra Adrian
Liu Jinnan
Massam Hélène
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2009
Field of study

In Bayesian analysis of multi-way contingency tables, the selection of a prior distribution for either the log-linear parameters or the cell probabilities parameters is a major challenge. In this paper, we define a flexible family of conjugate priors for the wide class of discrete hierarchical log-linear models, which includes the class of graphical models. These priors are defined as the Diaconis--Ylvisaker conjugate priors on the log-linear parameters subject to "baseline constraints" under multinomial sampling. We also derive the induced prior on the cell probabilities and show that the induced prior is a generalization of the hyper Dirichlet prior. We show that this prior has several desirable properties and illustrate its usefulness by identifying the most probable decomposable, graphical and hierarchical log-linear models for a six-way contingency table.Comment: Published in at http://dx.doi.org/10.1214/08-AOS669 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Two-state spin systems with negative interactions

Author: Fei Yumou
Goldberg Leslie
Lu Pinyan
Publication venue: Schloss Dagstuhl
Publication date: 24/01/2024
Field of study

We study the approximability of computing the partition functions of two-state spin systems. The problem is parameterized by a 2 × 2 symmetric matrix. Previous results on this problem were restricted either to the case where the matrix has non-negative entries, or to the case where the diagonal entries are equal, i.e. Ising models. In this paper, we study the generalization to arbitrary 2 × 2 interaction matrices with real entries. We show that in some regions of the parameter space, it’s #P-hard to even determine the sign of the partition function, while in other regions there are fully polynomial approximation schemes for the partition function. Our results reveal several new computational phase transitions

Oxford University Research Archive

Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First

Author: Wang Junxiang
Zhang Zheng
Zhao Liang
Publication venue
Publication date: 28/10/2023
Field of study

Graph Neural Networks (GNNs) have achieved great success in representing data with dependencies by recursively propagating and aggregating messages along the edges. However, edges in real-world graphs often have varying degrees of difficulty, and some edges may even be noisy to the downstream tasks. Therefore, existing GNNs may lead to suboptimal learned representations because they usually treat every edge in the graph equally. On the other hand, Curriculum Learning (CL), which mimics the human learning principle of learning data samples in a meaningful order, has been shown to be effective in improving the generalization ability and robustness of representation learners by gradually proceeding from easy to more difficult samples during training. Unfortunately, existing CL strategies are designed for independent data samples and cannot trivially generalize to handle data dependencies. To address these issues, we propose a novel CL strategy to gradually incorporate more edges into training according to their difficulty from easy to hard, where the degree of difficulty is measured by how well the edges are expected given the model training status. We demonstrate the strength of our proposed method in improving the generalization ability and robustness of learned representations through extensive experiments on nine synthetic datasets and nine real-world datasets. The code for our proposed method is available at https://github.com/rollingstonezz/Curriculum_learning_for_GNNs.Comment: Accepted by NeurIPS 202

arXiv.org e-Print Archive

Interrupting The Propaganda Supply Chain

Author: Bozic Bojan
Hamilton Kyle
Longo Luc
Publication venue: Dublin Institute of Technology
Publication date: 01/04/2021
Field of study

In this early-stage research, a multidisciplinary approach is presented for the detection of propaganda in the media, and for modeling the spread of propaganda and disinformation using semantic web and graph theory. An ontology will be designed which has the theoretical underpinnings from multiple disciplines including the social sciences and epidemiology. An additional objective of this work is to automate triple extraction from unstructured text which surpasses the state-of-the-art performance

Arrow@TUDublin

Recommended from our members

Mini-Workshop: Dimers, Ising and Spanning Trees beyond the Critical Isoradial Case (online meeting)

Author
Publication venue: Zürich : EMS Publ. House
Publication date: 01/01/2020
Field of study

The goal of this mini-workshop is to gather specialists of the dimer, Ising and spanning tree models around recent and ongoing progress in two directions. One is understanding the connection to the spectral curve of these models in the cases when the curve has positive genus. The other is the introduction of universal embeddings associated to these models. We aim to use these new tools to progress in the study of scaling limits

Repositorium für Naturwissenschaften und Technik

Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

Author: Bruss C. Bayan
Farivar Reza
Goodsitt Jeremy
Hines Keegan
Truong Anh
Walters Austin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/08/2019
Field of study

There has been considerable growth and interest in industrial applications of machine learning (ML) in recent years. ML engineers, as a consequence, are in high demand across the industry, yet improving the efficiency of ML engineers remains a fundamental challenge. Automated machine learning (AutoML) has emerged as a way to save time and effort on repetitive tasks in ML pipelines, such as data pre-processing, feature engineering, model selection, hyperparameter optimization, and prediction result analysis. In this paper, we investigate the current state of AutoML tools aiming to automate these tasks. We conduct various evaluations of the tools on many datasets, in different data segments, to examine their performance, and compare their advantages and disadvantages on different test cases

arXiv.org e-Print Archive

Crossref

Scipedia

Orbiter rarefied-flow reentry measurements from the OARE on STS-62

Author: Blanchard R. C.
Nicholson J. Y.
Publication venue
Publication date
Field of study

Acceleration data taken from the Orbital Acceleration Research Experiment (OARE) during reentry on STS-62 has been analyzed using calibration factors taken on-orbit. The data includes the flight regime from orbital altitudes down to about 100 km which covers the free-molecule-flow regime and some of the flow-transition into the hypersonic continuum. Ancillary data on orbiter position, orientation, velocity, and rotation rates have been used in models to transform the measured accelerations to the orbiter center-of-gravity, from which aerodynamic accelerations along the orbiter body axes have been calculated. Additional steps are discussed which remove residual offsets introduced in the measurements by unmodeled orbiter forces. The resulting aerodynamic accelerations and their ratios, A(sub z)/A(sub x), are discussed and compared with free-molecule-flow predictions of the aerodynamic coefficient ratios C(sub N)/C(sub A)

NASA Technical Reports Server