5 research outputs found
History Repeats: Overcoming Catastrophic Forgetting For Event-Centric Temporal Knowledge Graph Completion
Temporal knowledge graph (TKG) completion models typically rely on having
access to the entire graph during training. However, in real-world scenarios,
TKG data is often received incrementally as events unfold, leading to a dynamic
non-stationary data distribution over time. While one could incorporate
fine-tuning to existing methods to allow them to adapt to evolving TKG data,
this can lead to forgetting previously learned patterns. Alternatively,
retraining the model with the entire updated TKG can mitigate forgetting but is
computationally burdensome. To address these challenges, we propose a general
continual training framework that is applicable to any TKG completion method,
and leverages two key ideas: (i) a temporal regularization that encourages
repurposing of less important model parameters for learning new knowledge, and
(ii) a clustering-based experience replay that reinforces the past knowledge by
selectively preserving only a small portion of the past data. Our experimental
results on widely used event-centric TKG datasets demonstrate the effectiveness
of our proposed continual training framework in adapting to new events while
reducing catastrophic forgetting. Further, we perform ablation studies to show
the effectiveness of each component of our proposed framework. Finally, we
investigate the relation between the memory dedicated to experience replay and
the benefit gained from our clustering-based sampling strategy.Comment: 14 pages, 6 figure
Identifying and Analyzing Cryptocurrency Manipulations in Social Media
Interest surrounding cryptocurrencies, digital or virtual currencies that are
used as a medium for financial transactions, has grown tremendously in recent
years. The anonymity surrounding these currencies makes investors particularly
susceptible to fraud---such as "pump and dump" scams---where the goal is to
artificially inflate the perceived worth of a currency, luring victims into
investing before the fraudsters can sell their holdings. Because of the speed
and relative anonymity offered by social platforms such as Twitter and
Telegram, social media has become a preferred platform for scammers who wish to
spread false hype about the cryptocurrency they are trying to pump. In this
work we propose and evaluate a computational approach that can automatically
identify pump and dump scams as they unfold by combining information across
social media platforms. We also develop a multi-modal approach for predicting
whether a particular pump attempt will succeed or not. Finally, we analyze the
prevalence of bots in cryptocurrency related tweets, and observe a significant
increase in bot activity during the pump attempts.Comment: Section 4. Prediction tasks: The training setup and algorithm
revised. The details of the training algorithm added. More features added to
the feature set. Section 5. Botometer score added as the likelihood of a user
being bot. More analysis added on bot activity in cluster
Relational Learning to Capture the Dynamics and Sparsity of Knowledge Graphs
The rapid growth of large scale event datasets with timestamps has given rise to the dynamically evolving multi-relational knowledge graphs. Temporal reasoning over such data brings on many challenges and is still not well understood. Most real-world knowledge graphs are characterized by a long-tail relation frequency distribution where a significant fraction of relations occurs only a handful of times. This observation has given rise to the recent interest in low-shot learning methods that are able to generalize from only a few examples. The existing approaches, however, are tailored to static knowledge graphs and not easily generalized to temporal settings, where data scarcity poses even bigger problems, due to the occurrence of new, previously unseen relations. The goal of my doctoral research is to introduce new approaches for learning meaningful representation that capture the dynamics of temporal knowledge graphs while tackling various existing challenges such as data scarcity