8,662 research outputs found
Learning from Multi-View Multi-Way Data via Structural Factorization Machines
Real-world relations among entities can often be observed and determined by
different perspectives/views. For example, the decision made by a user on
whether to adopt an item relies on multiple aspects such as the contextual
information of the decision, the item's attributes, the user's profile and the
reviews given by other users. Different views may exhibit multi-way
interactions among entities and provide complementary information. In this
paper, we introduce a multi-tensor-based approach that can preserve the
underlying structure of multi-view data in a generic predictive model.
Specifically, we propose structural factorization machines (SFMs) that learn
the common latent spaces shared by multi-view tensors and automatically adjust
the importance of each view in the predictive model. Furthermore, the
complexity of SFMs is linear in the number of parameters, which make SFMs
suitable to large-scale problems. Extensive experiments on real-world datasets
demonstrate that the proposed SFMs outperform several state-of-the-art methods
in terms of prediction accuracy and computational cost.Comment: 10 page
Hierarchical and Incremental Structural Entropy Minimization for Unsupervised Social Event Detection
As a trending approach for social event detection, graph neural network
(GNN)-based methods enable a fusion of natural language semantics and the
complex social network structural information, thus showing SOTA performance.
However, GNN-based methods can miss useful message correlations. Moreover, they
require manual labeling for training and predetermining the number of events
for prediction. In this work, we address social event detection via graph
structural entropy (SE) minimization. While keeping the merits of the GNN-based
methods, the proposed framework, HISEvent, constructs more informative message
graphs, is unsupervised, and does not require the number of events given a
priori. Specifically, we incrementally explore the graph neighborhoods using
1-dimensional (1D) SE minimization to supplement the existing message graph
with edges between semantically related messages. We then detect events from
the message graph by hierarchically minimizing 2-dimensional (2D) SE. Our
proposed 1D and 2D SE minimization algorithms are customized for social event
detection and effectively tackle the efficiency problem of the existing SE
minimization algorithms. Extensive experiments show that HISEvent consistently
outperforms GNN-based methods and achieves the new SOTA for social event
detection under both closed- and open-set settings while being efficient and
robust.Comment: Accepted to AAAI 202
Prediction of triple point fermions in simple half-Heusler topological insulators
We predict the existence of triple point fermions in the band structure of
several half-Heusler topological insulators by calculations and the
Kane model. We find that many half-Heusler compounds exhibit multiple triple
points along four independent axes, through which the doubly degenerate
conduction bands and the nondegenerate valence band cross each other linearly
nearby the Fermi energy. When projected from the bulk to the (111) surface,
most of these triple points are located far away from the surface
point, as distinct from previously reported triple point fermion
candidates. These isolated triple points give rise to Fermi arcs on the
surface, that can be readily detected by photoemission spectroscopy or scanning
tunneling spectroscopy.Comment: 6 pages, 3 figures. The supplementary information is attached in the
latex packag
Modeling relation paths for knowledge base completion via joint adversarial training
Knowledge Base Completion (KBC), which aims at determining the missing
relations between entity pairs, has received increasing attention in recent
years. Most existing KBC methods focus on either embedding the Knowledge Base
(KB) into a specific semantic space or leveraging the joint probability of
Random Walks (RWs) on multi-hop paths. Only a few unified models take both
semantic and path-related features into consideration with adequacy. In this
paper, we propose a novel method to explore the intrinsic relationship between
the single relation (i.e. 1-hop path) and multi-hop paths between paired
entities. We use Hierarchical Attention Networks (HANs) to select important
relations in multi-hop paths and encode them into low-dimensional vectors. By
treating relations and multi-hop paths as two different input sources, we use a
feature extractor, which is shared by two downstream components (i.e. relation
classifier and source discriminator), to capture shared/similar information
between them. By joint adversarial training, we encourage our model to extract
features from the multi-hop paths which are representative for relation
completion. We apply the trained model (except for the source discriminator) to
several large-scale KBs for relation completion. Experimental results show that
our method outperforms existing path information-based approaches. Since each
sub-module of our model can be well interpreted, our model can be applied to a
large number of relation learning tasks.Comment: Accepted by Knowledge-Based System
Next-to-leading order QCD predictions for associated production at the CERN Large Hadron Collider
We calculate the complete next-to-leading-order (NLO) QCD corrections
(including SUSY QCD corrections) to the inclusive total cross sections of the
associated production processes in the minimal
supersymmetric standard model (MSSM) at the CERN Large Hadron Collider (LHC).
Our results show that the enhancement of the total cross sections from the NLO
QCD corrections can reach for 200 GeV GeV and
. The scale dependence of the total cross section is improved by
the NLO corrections, which is less than 5%. We also show the Monte Carlo
simulation results for the signature including the
complete NLO QCD effects, and find an observable signature above the standard
model (SM) background for a normal luminosity of 100 fb at the LHC.Comment: Published version in Phys.Rev.
Joint Learning of Local and Global Features for Aspect-based Sentiment Classification
Aspect-based sentiment classification (ASC) aims to judge the sentiment
polarity conveyed by the given aspect term in a sentence. The sentiment
polarity is not only determined by the local context but also related to the
words far away from the given aspect term. Most recent efforts related to the
attention-based models can not sufficiently distinguish which words they should
pay more attention to in some cases. Meanwhile, graph-based models are coming
into ASC to encode syntactic dependency tree information. But these models do
not fully leverage syntactic dependency trees as they neglect to incorporate
dependency relation tag information into representation learning effectively.
In this paper, we address these problems by effectively modeling the local and
global features. Firstly, we design a local encoder containing: a Gaussian mask
layer and a covariance self-attention layer. The Gaussian mask layer tends to
adjust the receptive field around aspect terms adaptively to deemphasize the
effects of unrelated words and pay more attention to local information. The
covariance self-attention layer can distinguish the attention weights of
different words more obviously. Furthermore, we propose a dual-level graph
attention network as a global encoder by fully employing dependency tag
information to capture long-distance information effectively. Our model
achieves state-of-the-art performance on both SemEval 2014 and Twitter
datasets.Comment: under revie
- …