578 research outputs found

    XAI in the Audit Domain - Explaining an Autoencoder Model for Anomaly Detection

    Get PDF
    Detecting erroneous or fraudulent business transactions andcorresponding journal entries imposes a significant challenge for auditors duringannual audits. One possible solution to cope with these problems is the use ofmachine learning methods, such as an autoencoder, to identify unusual journalentries within individual financial accounts. There are several methods for theinterpretation of such black-box models, summarized under the term eXplainableArtificial Intelligence (XAI), but these are not suitable for autoencoders. This paperproposes an approach for interpreting autoencoders, which consists of labelingthe journal entries first using the autoencoder and then training models suitablefor the application of XAI methods using these labels. The results obtained areevaluated with the help of human auditors, showing that an autoencoder model is not onlyable to capture relevant features of the domain but also provides additionalvaluable insights for identifying anomalous journal entries

    Detecting credit card fraud: An analysis of fraud detection techniques

    Get PDF
    Advancements in the modern age have brought many conveniences, one of those being credit cards. Providing an individual the ability to hold their entire purchasing power in the form of pocket-sized plastic cards have made credit cards the preferred method to complete financial transactions. However, these systems are not infallible and may provide criminals and other bad actors the opportunity to abuse them. Financial institutions and their customers lose billions of dollars every year to credit card fraud. To combat this issue, fraud detection systems are deployed to discover fraudulent activity after they have occurred. Such systems rely on advanced machine learning techniques and other supportive algorithms to detect and prevent fraud in the future. This work analyzes the various machine learning techniques for their ability to efficiently detect fraud and explores additional state-of-the-art techniques to assist with their performance. This work also proposes a generalized strategy to detect fraud regardless of a dataset\u27s features or unique characteristics. The high performing models discovered through this generalized strategy lay the foundation to build additional models based on state-of-the-art methods. This work expands on the issues of fraud detection, such as missing data and unbalanced datasets, and highlights models that combat these issues. Furthermore, state-of-the-art techniques, such as adapting to concept drift, are employed to combat fraud adaptation

    Machine learning and deep learning

    Full text link
    Today, intelligent systems that offer artificial intelligence capabilities often rely on machine learning. Machine learning describes the capacity of systems to learn from problem-specific training data to automate the process of analytical model building and solve associated tasks. Deep learning is a machine learning concept based on artificial neural networks. For many applications, deep learning models outperform shallow machine learning models and traditional data analysis approaches. In this article, we summarize the fundamentals of machine learning and deep learning to generate a broader understanding of the methodical underpinning of current intelligent systems. In particular, we provide a conceptual distinction between relevant terms and concepts, explain the process of automated analytical model building through machine learning and deep learning, and discuss the challenges that arise when implementing such intelligent systems in the field of electronic markets and networked business. These naturally go beyond technological aspects and highlight issues in human-machine interaction and artificial intelligence servitization.Comment: Published online first in Electronic Market

    Email fraud classifier using machine learning

    Get PDF
    Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2020, Director: Jordi José Bazán[en] Email is one of the most common methods of communication nowadays. Programs known as malware detection are essential to assist and protect users from the agents that are usually responsible for cyberattacks. This paper focuses on using machine learning algorithms to detect any possible email attacks by analyzing datasets of whitelists and blacklists. This document also includes other methods that try to solve this problem

    Cost-Sensitive Selective Classification and its Applications to Online Fraud Management

    Get PDF
    abstract: Fraud is defined as the utilization of deception for illegal gain by hiding the true nature of the activity. While organizations lose around $3.7 trillion in revenue due to financial crimes and fraud worldwide, they can affect all levels of society significantly. In this dissertation, I focus on credit card fraud in online transactions. Every online transaction comes with a fraud risk and it is the merchant's liability to detect and stop fraudulent transactions. Merchants utilize various mechanisms to prevent and manage fraud such as automated fraud detection systems and manual transaction reviews by expert fraud analysts. Many proposed solutions mostly focus on fraud detection accuracy and ignore financial considerations. Also, the highly effective manual review process is overlooked. First, I propose Profit Optimizing Neural Risk Manager (PONRM), a selective classifier that (a) constitutes optimal collaboration between machine learning models and human expertise under industrial constraints, (b) is cost and profit sensitive. I suggest directions on how to characterize fraudulent behavior and assess the risk of a transaction. I show that my framework outperforms cost-sensitive and cost-insensitive baselines on three real-world merchant datasets. While PONRM is able to work with many supervised learners and obtain convincing results, utilizing probability outputs directly from the trained model itself can pose problems, especially in deep learning as softmax output is not a true uncertainty measure. This phenomenon, and the wide and rapid adoption of deep learning by practitioners brought unintended consequences in many situations such as in the infamous case of Google Photos' racist image recognition algorithm; thus, necessitated the utilization of the quantified uncertainty for each prediction. There have been recent efforts towards quantifying uncertainty in conventional deep learning methods (e.g., dropout as Bayesian approximation); however, their optimal use in decision making is often overlooked and understudied. Thus, I present a mixed-integer programming framework for selective classification called MIPSC, that investigates and combines model uncertainty and predictive mean to identify optimal classification and rejection regions. I also extend this framework to cost-sensitive settings (MIPCSC) and focus on the critical real-world problem, online fraud management and show that my approach outperforms industry standard methods significantly for online fraud management in real-world settings.Dissertation/ThesisDoctoral Dissertation Computer Science 201

    Application of an Artificial Neural Network as a Third-Party Database Auditing System

    Get PDF
    Data auditing is a fundamental challenge for organizations who deal with large databases. Databases are frequently targeted by attacks that grow in quantity and sophistication every day, and one-third of which are coming from users inside the organizations. Database auditing plays a vital role in protecting against these attacks. Native features in data base auditing systems monitor and capture activities and incidents that occur within a database and notify the database administrator. However, the cost of administration and performance overhead in the software must be considered. As opposed to using native auditing tools, the better solution for having a more secure database is to utilize third-party products. The primary goal of this thesis is to utilize an efficient and optimized deep learning approach to detect suspicious behaviors within a database by calculating the amount of risk that each user poses for the system. This will be accomplished by using an Artificial Neural Network as an enhanced feature of analyzer component of a database auditing system. This ANN will work as a third-party product for the database auditing system. The model has been validated in order to have a low bias and low variance. Moreover, parameter tuning technique has been utilized to find the best parameters that would result in the highest accuracy for the model

    Machine Learning Algorithm to Detect Impersonation in an Essay-Based E-Exam

    Get PDF
    Essay-based E-exams require answers to be written out at some length in an E- learning platform The questions require a response with multiple paragraphs and should be logical and well-structured These type of examinations are increasingly becoming popular in academic institutions of higher learning based on the experience of COVID-19 pandemic Since the exam is mainly done virtually with reduced supervision the risk of impersonation and stolen content from other sources increases Due to this there is need to design cost effective and accurate techniques that are able to detect cheating in an essay based E- exa

    Forensic accounting and Cybersecurity examine their interrelation in the detection and Prevention of financial fraud

    Get PDF
    Due to the increase in the number of financial crimes and the increase in the number of cases related to this type of crimes in the courts, and because the task falls on the shoulders of a judicial accountant, the classical methods no longer work because of the complexity of cases and the increase in their number, and because the development of technology has contributed to the increase in the number of these crimes, also contributed with techniques through artificial intelligence in data analysis and easy access to solving these crimes, and this study aims to identify the effectiveness of machine learning in criminal accounting and its ability to facilitate the work of judicial accountants vıa Fraud detection is one of the main applications of artificial intelligence and machine learning in forensic accounting. By analyzing large data sets, machine learning algorithms can identify patterns and anomalies that may indicate fraudulent activity. These algorithms can also learn from previous cases and improve their accuracy over time ,Machine learning algorithms can identify irregularities and inconsistencies that may indicate financial crimes such as money laundering, embezzlement and tax fraud and is used in predictive analytics, allowing forensic accountants to anticipate possible financial crimes before they occu

    Unveiling the frontiers of deep learning: innovations shaping diverse domains

    Full text link
    Deep learning (DL) enables the development of computer models that are capable of learning, visualizing, optimizing, refining, and predicting data. In recent years, DL has been applied in a range of fields, including audio-visual data processing, agriculture, transportation prediction, natural language, biomedicine, disaster management, bioinformatics, drug design, genomics, face recognition, and ecology. To explore the current state of deep learning, it is necessary to investigate the latest developments and applications of deep learning in these disciplines. However, the literature is lacking in exploring the applications of deep learning in all potential sectors. This paper thus extensively investigates the potential applications of deep learning across all major fields of study as well as the associated benefits and challenges. As evidenced in the literature, DL exhibits accuracy in prediction and analysis, makes it a powerful computational tool, and has the ability to articulate itself and optimize, making it effective in processing data with no prior training. Given its independence from training data, deep learning necessitates massive amounts of data for effective analysis and processing, much like data volume. To handle the challenge of compiling huge amounts of medical, scientific, healthcare, and environmental data for use in deep learning, gated architectures like LSTMs and GRUs can be utilized. For multimodal learning, shared neurons in the neural network for all activities and specialized neurons for particular tasks are necessary.Comment: 64 pages, 3 figures, 3 table

    Multimodal Document Analytics for Banking Process Automation

    Full text link
    In response to growing FinTech competition and the need for improved operational efficiency, this research focuses on understanding the potential of advanced document analytics, particularly using multimodal models, in banking processes. We perform a comprehensive analysis of the diverse banking document landscape, highlighting the opportunities for efficiency gains through automation and advanced analytics techniques in the customer business. Building on the rapidly evolving field of natural language processing (NLP), we illustrate the potential of models such as LayoutXLM, a cross-lingual, multimodal, pre-trained model, for analyzing diverse documents in the banking sector. This model performs a text token classification on German company register extracts with an overall F1 score performance of around 80\%. Our empirical evidence confirms the critical role of layout information in improving model performance and further underscores the benefits of integrating image information. Interestingly, our study shows that over 75% F1 score can be achieved with only 30% of the training data, demonstrating the efficiency of LayoutXLM. Through addressing state-of-the-art document analysis frameworks, our study aims to enhance process efficiency and demonstrate the real-world applicability and benefits of multimodal models within banking.Comment: A Preprin
    • …
    corecore