49,967 research outputs found
Anomaly Detection In Blockchain
Anomaly detection has been a well-studied area for a long time. Its applications in the financial sector have aided in identifying suspicious activities of hackers. However, with the advancements in the financial domain such as blockchain and artificial intelligence, it is more challenging to deceive financial systems. Despite these technological advancements many fraudulent cases have still emerged.
Many artificial intelligence techniques have been proposed to deal with the anomaly detection problem; some results appear to be considerably assuring, but there is no explicit superior solution. This thesis leaps to bridge the gap between artificial intelligence and blockchain by pursuing various anomaly detection techniques on transactional network data of a public financial blockchain named 'Bitcoin'.
This thesis also presents an overview of the blockchain technology and its application in the financial sector in light of anomaly detection. Furthermore, it extracts the transactional data of bitcoin blockchain and analyses for malicious transactions using unsupervised machine learning techniques. A range of algorithms such as isolation forest, histogram based outlier detection (HBOS), cluster based local outlier factor (CBLOF), principal component analysis (PCA), K-means, deep autoencoder networks and ensemble method are evaluated and compared
HYPA: Efficient Detection of Path Anomalies in Time Series Data on Networks
The unsupervised detection of anomalies in time series data has important
applications in user behavioral modeling, fraud detection, and cybersecurity.
Anomaly detection has, in fact, been extensively studied in categorical
sequences. However, we often have access to time series data that represent
paths through networks. Examples include transaction sequences in financial
networks, click streams of users in networks of cross-referenced documents, or
travel itineraries in transportation networks. To reliably detect anomalies, we
must account for the fact that such data contain a large number of independent
observations of paths constrained by a graph topology. Moreover, the
heterogeneity of real systems rules out frequency-based anomaly detection
techniques, which do not account for highly skewed edge and degree statistics.
To address this problem, we introduce HYPA, a novel framework for the
unsupervised detection of anomalies in large corpora of variable-length
temporal paths in a graph. HYPA provides an efficient analytical method to
detect paths with anomalous frequencies that result from nodes being traversed
in unexpected chronological order.Comment: 11 pages with 8 figures and supplementary material. To appear at SIAM
Data Mining (SDM 2020
Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs
Detecting anomalies in general ledger data is of utmost importance to ensure
trustworthiness of financial records. Financial audits increasingly rely on
machine learning (ML) algorithms to identify irregular or potentially
fraudulent journal entries, each characterized by a varying number of
transactions. In machine learning, heterogeneity in feature dimensions adds
significant complexity to data analysis. In this paper, we introduce a novel
approach to anomaly detection in financial data using Large Language Models
(LLMs) embeddings. To encode non-semantic categorical data from real-world
financial records, we tested 3 pre-trained general purpose sentence-transformer
models. For the downstream classification task, we implemented and evaluated 5
optimized ML models including Logistic Regression, Random Forest, Gradient
Boosting Machines, Support Vector Machines, and Neural Networks. Our
experiments demonstrate that LLMs contribute valuable information to anomaly
detection as our models outperform the baselines, in selected settings even by
a large margin. The findings further underscore the effectiveness of LLMs in
enhancing anomaly detection in financial journal entries, particularly by
tackling feature sparsity. We discuss a promising perspective on using LLM
embeddings for non-semantic data in the financial context and beyond
Enhancing Anomaly Detection in Financial Markets with an LLM-based Multi-Agent Framework
This paper introduces a Large Language Model (LLM)-based multi-agent
framework designed to enhance anomaly detection within financial market data,
tackling the longstanding challenge of manually verifying system-generated
anomaly alerts. The framework harnesses a collaborative network of AI agents,
each specialised in distinct functions including data conversion, expert
analysis via web research, institutional knowledge utilization or
cross-checking and report consolidation and management roles. By coordinating
these agents towards a common objective, the framework provides a comprehensive
and automated approach for validating and interpreting financial data
anomalies. I analyse the S&P 500 index to demonstrate the framework's
proficiency in enhancing the efficiency, accuracy and reduction of human
intervention in financial market monitoring. The integration of AI's autonomous
functionalities with established analytical methods not only underscores the
framework's effectiveness in anomaly detection but also signals its broader
applicability in supporting financial market monitoring
Deep learning for time series anomaly detection:A survey
Time series anomaly detection is important for a wide range of research fields and applications, including financial markets, economics, earth sciences, manufacturing, and healthcare. The presence of anomalies can indicate novel or unexpected events, such as production faults, system defects, and heart palpitations, and is therefore of particular interest. The large size and complexity of patterns in time series data have led researchers to develop specialised deep learning models for detecting anomalous patterns. This survey provides a structured and comprehensive overview of state-of-the-art deep learning for time series anomaly detection. It provides a taxonomy based on anomaly detection strategies and deep learning models. Aside from describing the basic anomaly detection techniques in each category, their advantages and limitations are also discussed. Furthermore, this study includes examples of deep anomaly detection in time series across various application domains in recent years. Finally, it summarises open issues in research and challenges faced while adopting deep anomaly detection models to time series data
Deep Semi-Supervised Anomaly Detection for Finding Fraud in the Futures Market
Modern financial electronic exchanges are an exciting and fast-paced
marketplace where billions of dollars change hands every day. They are also
rife with manipulation and fraud. Detecting such activity is a major
undertaking, which has historically been a job reserved exclusively for humans.
Recently, more research and resources have been focused on automating these
processes via machine learning and artificial intelligence. Fraud detection is
overwhelmingly associated with the greater field of anomaly detection, which is
usually performed via unsupervised learning techniques because of the lack of
labeled data needed for supervised learning. However, a small quantity of
labeled data does often exist. This research article aims to evaluate the
efficacy of a deep semi-supervised anomaly detection technique, called Deep
SAD, for detecting fraud in high-frequency financial data. We use exclusive
proprietary limit order book data from the TMX exchange in Montr\'eal, with a
small set of true labeled instances of fraud, to evaluate Deep SAD against its
unsupervised predecessor. We show that incorporating a small amount of labeled
data into an unsupervised anomaly detection framework can greatly improve its
accuracy.Comment: 8 pages, 3 figure
- …