Search CORE

9,174 research outputs found

KGAT: Knowledge Graph Attention Network for Recommendation

Author: Cao Yixin
Chua Tat-Seng
He Xiangnan
Liu Meng
Wang Xiang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 07/06/2019
Field of study

To provide more accurate, diverse, and explainable recommendation, it is compulsory to go beyond modeling user-item interactions and take side information into account. Traditional methods like factorization machine (FM) cast it as a supervised learning problem, which assumes each interaction as an independent instance with side information encoded. Due to the overlook of the relations among instances or items (e.g., the director of a movie is also an actor of another movie), these methods are insufficient to distill the collaborative signal from the collective behaviors of users. In this work, we investigate the utility of knowledge graph (KG), which breaks down the independent interaction assumption by linking items with their attributes. We argue that in such a hybrid structure of KG and user-item graph, high-order relations --- which connect two items with one or multiple linked attributes --- are an essential factor for successful recommendation. We propose a new method named Knowledge Graph Attention Network (KGAT) which explicitly models the high-order connectivities in KG in an end-to-end fashion. It recursively propagates the embeddings from a node's neighbors (which can be users, items, or attributes) to refine the node's embedding, and employs an attention mechanism to discriminate the importance of the neighbors. Our KGAT is conceptually advantageous to existing KG-based recommendation methods, which either exploit high-order relations by extracting paths or implicitly modeling them with regularization. Empirical results on three public benchmarks show that KGAT significantly outperforms state-of-the-art methods like Neural FM and RippleNet. Further studies verify the efficacy of embedding propagation for high-order relation modeling and the interpretability benefits brought by the attention mechanism.Comment: KDD 2019 research trac

arXiv.org e-Print Archive

Crossref

Institutional Knowledge at Singapore Management University

ScholarBank@NUS

Joint Intermodal and Intramodal Label Transfers for Extremely Rare or Unseen Classes

Author: Aggarwal Charu
Huang Thomas
Liu Wei
Qi Guo-Jun
Publication venue
Publication date: 22/03/2017
Field of study

In this paper, we present a label transfer model from texts to images for image classification tasks. The problem of image classification is often much more challenging than text classification. On one hand, labeled text data is more widely available than the labeled images for classification tasks. On the other hand, text data tends to have natural semantic interpretability, and they are often more directly related to class labels. On the contrary, the image features are not directly related to concepts inherent in class labels. One of our goals in this paper is to develop a model for revealing the functional relationships between text and image features as to directly transfer intermodal and intramodal labels to annotate the images. This is implemented by learning a transfer function as a bridge to propagate the labels between two multimodal spaces. However, the intermodal label transfers could be undermined by blindly transferring the labels of noisy texts to annotate images. To mitigate this problem, we present an intramodal label transfer process, which complements the intermodal label transfer by transferring the image labels instead when relevant text is absent from the source corpus. In addition, we generalize the inter-modal label transfer to zero-shot learning scenario where there are only text examples available to label unseen classes of images without any positive image examples. We evaluate our algorithm on an image classification task and show the effectiveness with respect to the other compared algorithms.Comment: The paper has been accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence. It will apear in a future issu

arXiv.org e-Print Archive

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

User Modeling and User Profiling: A Comprehensive Survey

Author: Boratto Ludovico
De Luca Ernesto William
Purificato Erasmo
Publication venue
Publication date: 20/02/2024
Field of study

The integration of artificial intelligence (AI) into daily life, particularly through information retrieval and recommender systems, has necessitated advanced user modeling and profiling techniques to deliver personalized experiences. These techniques aim to construct accurate user representations based on the rich amounts of data generated through interactions with these systems. This paper presents a comprehensive survey of the current state, evolution, and future directions of user modeling and profiling research. We provide a historical overview, tracing the development from early stereotype models to the latest deep learning techniques, and propose a novel taxonomy that encompasses all active topics in this research area, including recent trends. Our survey highlights the paradigm shifts towards more sophisticated user profiling methods, emphasizing implicit data collection, multi-behavior modeling, and the integration of graph data structures. We also address the critical need for privacy-preserving techniques and the push towards explainability and fairness in user modeling approaches. By examining the definitions of core terminology, we aim to clarify ambiguities and foster a clearer understanding of the field by proposing two novel encyclopedic definitions of the main terms. Furthermore, we explore the application of user modeling in various domains, such as fake news detection, cybersecurity, and personalized education. This survey serves as a comprehensive resource for researchers and practitioners, offering insights into the evolution of user modeling and profiling and guiding the development of more personalized, ethical, and effective AI systems.Comment: 71 page

arXiv.org e-Print Archive

A review of wildland fire spread modelling, 1990-present 3: Mathematical analogues and simulation models

Author: Albinet
Albini
Albini
Anderson
Andrew L. Sullivan
Asensio
Asensio
Bak
Bak
Ball
Beer
Beer
Beer
Berjak
Butler
Caldarelli
Catchpole
Chen
Cheney
Cheney
Clar
Clark
Clark
Clark
Clarke
Coen
Coleman
Corral
Curry
Dercole
Drossel
Drossel
Drossel
Duarte
Eklund
Enting
Favier
Finney
French
Fujioka
Gardner
Graham
Grassberger
Green
Green
Green
Hargrove
Hern�ndez Encinas
Hern�ndez Encinas
Hogeweg
Johnston
Karafyllidis
Karafyllidis
Karafyllidis
Karplus
King
Knight
Kourtz
Langton
Li
Lopes
Loreto
Malamud
Malamud
McAlpine
Mendez
Mraz
Muzy
Muzy
Nahmias
Nelson
Noble
Ntaimo
Pastor-Satorras
Perry
Pielke
Plourde
Porterie
Pruessner
Reed
Rhodes
Richards
Richards
Richards
Ricotta
Schenk
Schenk
Ser�n
Speer
Sullivan
Sullivan
Taplin
Vakalis
Van Wagner
Van Wagner
Vasconcelos
von Niessen
Wallace
Watt
Wolfram
Publication venue: 'CSIRO Publishing'
Publication date: 27/06/2007
Field of study

In recent years, advances in computational power and spatial data analysis (GIS, remote sensing, etc) have led to an increase in attempts to model the spread and behvaiour of wildland fires across the landscape. This series of review papers endeavours to critically and comprehensively review all types of surface fire spread models developed since 1990. This paper reviews models of a simulation or mathematical analogue nature. Most simulation models are implementations of existing empirical or quasi-empirical models and their primary function is to convert these generally one dimensional models to two dimensions and then propagate a fire perimeter across a modelled landscape. Mathematical analogue models are those that are based on some mathematical conceit (rather than a physical representation of fire spread) that coincidentally simulates the spread of fire. Other papers in the series review models of an physical or quasi-physical nature and empirical or quasi-empirical nature. Many models are extensions or refinements of models developed before 1990. Where this is the case, these models are also discussed but much less comprehensively.Comment: 20 pages + 9 pages references + 1 page figures. Submitted to the International Journal of Wildland Fir

arXiv.org e-Print Archive

Crossref

How did the discussion go: Discourse act classification in social media conversations

Author: B O’Connor
J Bollen
K Scott
Mirko Lai
ML Larson
S Bhatia
S Hochreiter
S Hochreiter
Subhabrata Dutta
T Chakraborty
V Eisenlauer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/08/2018
Field of study

We propose a novel attention based hierarchical LSTM model to classify discourse act sequences in social media conversations, aimed at mining data from online discussion using textual meanings beyond sentence level. The very uniqueness of the task is the complete categorization of possible pragmatic roles in informal textual discussions, contrary to extraction of question-answers, stance detection or sarcasm identification which are very much role specific tasks. Early attempt was made on a Reddit discussion dataset. We train our model on the same data, and present test results on two different datasets, one from Reddit and one from Facebook. Our proposed model outperformed the previous one in terms of domain independence; without using platform-dependent structural features, our hierarchical LSTM with word relevance attention mechanism achieved F1-scores of 71\% and 66\% respectively to predict discourse roles of comments in Reddit and Facebook discussions. Efficiency of recurrent and convolutional architectures in order to learn discursive representation on the same task has been presented and analyzed, with different word and comment embedding schemes. Our attention mechanism enables us to inquire into relevance ordering of text segments according to their roles in discourse. We present a human annotator experiment to unveil important observations about modeling and data annotation. Equipped with our text-based discourse identification model, we inquire into how heterogeneous non-textual features like location, time, leaning of information etc. play their roles in charaterizing online discussions on Facebook

arXiv.org e-Print Archive

Crossref