17 research outputs found
Forgetting-aware Linear Bias for Attentive Knowledge Tracing
Knowledge Tracing (KT) aims to track proficiency based on a question-solving
history, allowing us to offer a streamlined curriculum. Recent studies actively
utilize attention-based mechanisms to capture the correlation between questions
and combine it with the learner's characteristics for responses. However, our
empirical study shows that existing attention-based KT models neglect the
learner's forgetting behavior, especially as the interaction history becomes
longer. This problem arises from the bias that overprioritizes the correlation
of questions while inadvertently ignoring the impact of forgetting behavior.
This paper proposes a simple-yet-effective solution, namely Forgetting-aware
Linear Bias (FoLiBi), to reflect forgetting behavior as a linear bias. Despite
its simplicity, FoLiBi is readily equipped with existing attentive KT models by
effectively decomposing question correlations with forgetting behavior. FoLiBi
plugged with several KT models yields a consistent improvement of up to 2.58%
in AUC over state-of-the-art KT models on four benchmark datasets.Comment: In Proceedings of the 32nd ACM International Conference on
Information and Knowledge Management (CIKM'23), 5 pages, 3 figures, 2 table
Toward a Better Understanding of Loss Functions for Collaborative Filtering
Collaborative filtering (CF) is a pivotal technique in modern recommender
systems. The learning process of CF models typically consists of three
components: interaction encoder, loss function, and negative sampling. Although
many existing studies have proposed various CF models to design sophisticated
interaction encoders, recent work shows that simply reformulating the loss
functions can achieve significant performance gains. This paper delves into
analyzing the relationship among existing loss functions. Our mathematical
analysis reveals that the previous loss functions can be interpreted as
alignment and uniformity functions: (i) the alignment matches user and item
representations, and (ii) the uniformity disperses user and item distributions.
Inspired by this analysis, we propose a novel loss function that improves the
design of alignment and uniformity considering the unique patterns of datasets
called Margin-aware Alignment and Weighted Uniformity (MAWU). The key novelty
of MAWU is two-fold: (i) margin-aware alignment (MA) mitigates
user/item-specific popularity biases, and (ii) weighted uniformity (WU) adjusts
the significance between user and item uniformities to reflect the inherent
characteristics of datasets. Extensive experimental results show that MF and
LightGCN equipped with MAWU are comparable or superior to state-of-the-art CF
models with various loss functions on three public datasets.Comment: Accepted by CIKM 202
A Rule-based Skyline Computation over a dynamic database
Skyline query which relies on the notion of Pareto dominance filters the data items from a database by ensuring only those data items that are not worse than any others are selected as skylines. However, the dynamic nature of databases in which their states and/or structures change throughout their lifetime to incorporate the current and latest information of database applications, requires a new set of skylines to be derived. Blindly computing skylines on the new state/structure of a database is inefficient, as not all the data items are affected by the changes. Hence, this paper proposes a rule-based approach in tackling the above issue with the main aim at avoiding unnecessary skyline computations. Based on the type of operation that changes the state/structure of a database, i.e. insert/delete/update a data item(s) or add/remove a dimension(s), a set of rules are defined. Besides, the prominent dominance relationships when pairwise comparisons are performed are retained; which are then utilised in the process of computing a new set of skylines. Several analyses have been conducted to evaluate the performance and prove the efficiency of our proposed solution
Query Result Clustering for Object-level Search ∗
Query result clustering has recently attracted a lot of attention to provide users with a succinct overview of relevant results. However, little work has been done on organizing the query results for object-level search. Object-level search result clustering is challenging because we need to support diverse similarity notions over object-specific features (such as the price and weight of a product) of heterogeneous domains. To address this challenge, we propose a hybrid subspace clustering algorithm called Hydra. Algorithm Hydra captures the user perception of diverse similarity notions from millions of Web pages and disambiguates different senses using featurebased subspace locality measures. Our proposed solution, by combining wisdom of crowds and wisdom of data, achieves robustness and efficiency over existing approaches. We extensively evaluate our proposed framework and demonstrate how to enrich user experiences in object-level search using a real-world product search scenarios