578 research outputs found
XAI in the Audit Domain - Explaining an Autoencoder Model for Anomaly Detection
Detecting erroneous or fraudulent business transactions andcorresponding journal entries imposes a significant challenge for auditors duringannual audits. One possible solution to cope with these problems is the use ofmachine learning methods, such as an autoencoder, to identify unusual journalentries within individual financial accounts. There are several methods for theinterpretation of such black-box models, summarized under the term eXplainableArtificial Intelligence (XAI), but these are not suitable for autoencoders. This paperproposes an approach for interpreting autoencoders, which consists of labelingthe journal entries first using the autoencoder and then training models suitablefor the application of XAI methods using these labels. The results obtained areevaluated with the help of human auditors, showing that an autoencoder model is not onlyable to capture relevant features of the domain but also provides additionalvaluable insights for identifying anomalous journal entries
Detecting credit card fraud: An analysis of fraud detection techniques
Advancements in the modern age have brought many conveniences, one of those being credit cards. Providing an individual the ability to hold their entire purchasing power in the form of pocket-sized plastic cards have made credit cards the preferred method to complete financial transactions. However, these systems are not infallible and may provide criminals and other bad actors the opportunity to abuse them. Financial institutions and their customers lose billions of dollars every year to credit card fraud. To combat this issue, fraud detection systems are deployed to discover fraudulent activity after they have occurred. Such systems rely on advanced machine learning techniques and other supportive algorithms to detect and prevent fraud in the future. This work analyzes the various machine learning techniques for their ability to efficiently detect fraud and explores additional state-of-the-art techniques to assist with their performance. This work also proposes a generalized strategy to detect fraud regardless of a dataset\u27s features or unique characteristics. The high performing models discovered through this generalized strategy lay the foundation to build additional models based on state-of-the-art methods. This work expands on the issues of fraud detection, such as missing data and unbalanced datasets, and highlights models that combat these issues. Furthermore, state-of-the-art techniques, such as adapting to concept drift, are employed to combat fraud adaptation
Machine learning and deep learning
Today, intelligent systems that offer artificial intelligence capabilities
often rely on machine learning. Machine learning describes the capacity of
systems to learn from problem-specific training data to automate the process of
analytical model building and solve associated tasks. Deep learning is a
machine learning concept based on artificial neural networks. For many
applications, deep learning models outperform shallow machine learning models
and traditional data analysis approaches. In this article, we summarize the
fundamentals of machine learning and deep learning to generate a broader
understanding of the methodical underpinning of current intelligent systems. In
particular, we provide a conceptual distinction between relevant terms and
concepts, explain the process of automated analytical model building through
machine learning and deep learning, and discuss the challenges that arise when
implementing such intelligent systems in the field of electronic markets and
networked business. These naturally go beyond technological aspects and
highlight issues in human-machine interaction and artificial intelligence
servitization.Comment: Published online first in Electronic Market
Email fraud classifier using machine learning
Treballs Finals de Grau d'Enginyeria Informà tica, Facultat de Matemà tiques, Universitat de Barcelona, Any: 2020, Director: Jordi José Bazán[en] Email is one of the most common methods of communication nowadays. Programs known as malware detection are essential to assist and protect users from the agents that are usually responsible for cyberattacks. This paper focuses on using machine learning algorithms to detect any possible email attacks by analyzing datasets of whitelists and blacklists. This document also includes other methods that try to solve this problem
Cost-Sensitive Selective Classification and its Applications to Online Fraud Management
abstract: Fraud is defined as the utilization of deception for illegal gain by hiding the true nature of the activity. While organizations lose around $3.7 trillion in revenue due to financial crimes and fraud worldwide, they can affect all levels of society significantly. In this dissertation, I focus on credit card fraud in online transactions. Every online transaction comes with a fraud risk and it is the merchant's liability to detect and stop fraudulent transactions. Merchants utilize various mechanisms to prevent and manage fraud such as automated fraud detection systems and manual transaction reviews by expert fraud analysts. Many proposed solutions mostly focus on fraud detection accuracy and ignore financial considerations. Also, the highly effective manual review process is overlooked. First, I propose Profit Optimizing Neural Risk Manager (PONRM), a selective classifier that (a) constitutes optimal collaboration between machine learning models and human expertise under industrial constraints, (b) is cost and profit sensitive. I suggest directions on how to characterize fraudulent behavior and assess the risk of a transaction. I show that my framework outperforms cost-sensitive and cost-insensitive baselines on three real-world merchant datasets. While PONRM is able to work with many supervised learners and obtain convincing results, utilizing probability outputs directly from the trained model itself can pose problems, especially in deep learning as softmax output is not a true uncertainty measure. This phenomenon, and the wide and rapid adoption of deep learning by practitioners brought unintended consequences in many situations such as in the infamous case of Google Photos' racist image recognition algorithm; thus, necessitated the utilization of the quantified uncertainty for each prediction. There have been recent efforts towards quantifying uncertainty in conventional deep learning methods (e.g., dropout as Bayesian approximation); however, their optimal use in decision making is often overlooked and understudied. Thus, I present a mixed-integer programming framework for selective classification called MIPSC, that investigates and combines model uncertainty and predictive mean to identify optimal classification and rejection regions. I also extend this framework to cost-sensitive settings (MIPCSC) and focus on the critical real-world problem, online fraud management and show that my approach outperforms industry standard methods significantly for online fraud management in real-world settings.Dissertation/ThesisDoctoral Dissertation Computer Science 201
Application of an Artificial Neural Network as a Third-Party Database Auditing System
Data auditing is a fundamental challenge for organizations who deal with large databases. Databases are frequently targeted by attacks that grow in quantity and sophistication every day, and one-third of which are coming from users inside the organizations. Database auditing plays a vital role in protecting against these attacks. Native features in data base auditing systems monitor and capture activities and incidents that occur within a database and notify the database administrator. However, the cost of administration and performance overhead in the software must be considered. As opposed to using native auditing tools, the better solution for having a more secure database is to utilize third-party products. The primary goal of this thesis is to utilize an efficient and optimized deep learning approach to detect suspicious behaviors within a database by calculating the amount of risk that each user poses for the system. This will be accomplished by using an Artificial Neural Network as an enhanced feature of analyzer component of a database auditing system. This ANN will work as a third-party product for the database auditing system. The model has been validated in order to have a low bias and low variance. Moreover, parameter tuning technique has been utilized to find the best parameters that would result in the highest accuracy for the model
Machine Learning Algorithm to Detect Impersonation in an Essay-Based E-Exam
Essay-based E-exams require answers to be written out at some length in an E- learning platform The questions require a response with multiple paragraphs and should be logical and well-structured These type of examinations are increasingly becoming popular in academic institutions of higher learning based on the experience of COVID-19 pandemic Since the exam is mainly done virtually with reduced supervision the risk of impersonation and stolen content from other sources increases Due to this there is need to design cost effective and accurate techniques that are able to detect cheating in an essay based E- exa
Forensic accounting and Cybersecurity examine their interrelation in the detection and Prevention of financial fraud
Due to the increase in the number of financial crimes and the increase in the number of cases related to this type of crimes in the courts, and because the task falls on the shoulders of a judicial accountant, the classical methods no longer work because of the complexity of cases and the increase in their number, and because the development of technology has contributed to the increase in the number of these crimes, also contributed with techniques through artificial intelligence in data analysis and easy access to solving these crimes, and this study aims to identify the effectiveness of machine learning in criminal accounting and its ability to facilitate the work of judicial accountants vıa Fraud detection is one of the main applications of artificial intelligence and machine learning in forensic accounting. By analyzing large data sets, machine learning algorithms can identify patterns and anomalies that may indicate fraudulent activity. These algorithms can also learn from previous cases and improve their accuracy over time ,Machine learning algorithms can identify irregularities and inconsistencies that may indicate financial crimes such as money laundering, embezzlement and tax fraud and is used in predictive analytics, allowing forensic accountants to anticipate possible financial crimes before they occu
Unveiling the frontiers of deep learning: innovations shaping diverse domains
Deep learning (DL) enables the development of computer models that are
capable of learning, visualizing, optimizing, refining, and predicting data. In
recent years, DL has been applied in a range of fields, including audio-visual
data processing, agriculture, transportation prediction, natural language,
biomedicine, disaster management, bioinformatics, drug design, genomics, face
recognition, and ecology. To explore the current state of deep learning, it is
necessary to investigate the latest developments and applications of deep
learning in these disciplines. However, the literature is lacking in exploring
the applications of deep learning in all potential sectors. This paper thus
extensively investigates the potential applications of deep learning across all
major fields of study as well as the associated benefits and challenges. As
evidenced in the literature, DL exhibits accuracy in prediction and analysis,
makes it a powerful computational tool, and has the ability to articulate
itself and optimize, making it effective in processing data with no prior
training. Given its independence from training data, deep learning necessitates
massive amounts of data for effective analysis and processing, much like data
volume. To handle the challenge of compiling huge amounts of medical,
scientific, healthcare, and environmental data for use in deep learning, gated
architectures like LSTMs and GRUs can be utilized. For multimodal learning,
shared neurons in the neural network for all activities and specialized neurons
for particular tasks are necessary.Comment: 64 pages, 3 figures, 3 table
Multimodal Document Analytics for Banking Process Automation
In response to growing FinTech competition and the need for improved
operational efficiency, this research focuses on understanding the potential of
advanced document analytics, particularly using multimodal models, in banking
processes. We perform a comprehensive analysis of the diverse banking document
landscape, highlighting the opportunities for efficiency gains through
automation and advanced analytics techniques in the customer business. Building
on the rapidly evolving field of natural language processing (NLP), we
illustrate the potential of models such as LayoutXLM, a cross-lingual,
multimodal, pre-trained model, for analyzing diverse documents in the banking
sector. This model performs a text token classification on German company
register extracts with an overall F1 score performance of around 80\%. Our
empirical evidence confirms the critical role of layout information in
improving model performance and further underscores the benefits of integrating
image information. Interestingly, our study shows that over 75% F1 score can be
achieved with only 30% of the training data, demonstrating the efficiency of
LayoutXLM. Through addressing state-of-the-art document analysis frameworks,
our study aims to enhance process efficiency and demonstrate the real-world
applicability and benefits of multimodal models within banking.Comment: A Preprin
- …