5 research outputs found

    Adaptive One-Dimensional Convolutional Neural Network for Tabular Data

    Get PDF
    This study introduces an innovative approach for tackling the credit risk prediction problem using an Adaptive One-Dimensional Convolutional Neural Network (1D CNN). The proposed methodology is designed for one-dimensional data, such as tabular data, through a combination of feed-forward and back-propagation phases. During the feed-forward phase, neuron outputs are computed by applying convolution operations to previous layer outputs, along with bias terms and activation functions. The subsequent back-propagation phase updates weights and biases to minimize prediction errors. A custom weight initialization algorithm tailored to Leaky ReLU activation is employed to enhance model adaptability. The core of the proposed algorithm lies in its ability to process each training data sample across layers, optimizing weights and biases to achieve accurate predictions. Comprehensive evaluations are conducted on various machine learning algorithms, including Gaussian Naive Bayes, Logistic Regression, ensemble methods, and neural networks. The proposed Adaptive 1D CNN emerges as the top performer, consistently surpassing other methods in precision, recall, F1-score, and accuracy. This success is attributed to its specialized weight initialization, effective back-propagation, and integration of 1D convolutional layers

    Bankruptcy prediction model using cost-sensitive extreme gradient boosting in the context of imbalanced datasets

    Get PDF
    In the process of bankruptcy prediction models, a class imbalanced problem has occurred which limits the performance of the models. Most prior research addressed the problem by applying resampling methods such as the synthetic minority oversampling technique (SMOTE). However, resampling methods lead to other issues, e.g., increasing noisy data and training time during the process. To improve the bankruptcy prediction model, we propose cost-sensitive extreme gradient boosting (CS-XGB) to address the class imbalanced problem without requiring any resampling method. The proposed method’s effectiveness is evaluated on six real-world datasets, i.e., the LendingClub, and five Polish companies’ bankruptcy. This research compares the performance of CS-XGB with other ensemble methods, including SMOTE-XGB which applies SMOTE to the training set before the learning process. The experimental results show that i) based on LendingClub, the CS-XGB improves the performance of XGBoost and SMOTE-XGB by more than 50% and 33% on bankruptcy detection rate (BDR) and geometric mean (GM), respectively, and ii) the CS-XGB model outperforms random forest (RF), Bagging, AdaBoost, XGBoost, and SMOTE-XGB in terms of BDR, GM, and the area under a receiver operating characteristic curve (AUC) based on the five Polish datasets. Besides, the CS-XGB model achieves good overall prediction results

    EGMM: an Evidential Version of the Gaussian Mixture Model for Clustering

    Full text link
    The Gaussian mixture model (GMM) provides a convenient yet principled framework for clustering, with properties suitable for statistical inference. In this paper, we propose a new model-based clustering algorithm, called EGMM (evidential GMM), in the theoretical framework of belief functions to better characterize cluster-membership uncertainty. With a mass function representing the cluster membership of each object, the evidential Gaussian mixture distribution composed of the components over the powerset of the desired clusters is proposed to model the entire dataset. The parameters in EGMM are estimated by a specially designed Expectation-Maximization (EM) algorithm. A validity index allowing automatic determination of the proper number of clusters is also provided. The proposed EGMM is as convenient as the classical GMM, but can generate a more informative evidential partition for the considered dataset. Experiments with synthetic and real datasets demonstrate the good performance of the proposed method as compared with some other prototype-based and model-based clustering techniques

    Análisis de determinantes y gestión de riesgos en crowdfunding de préstamos entre pares

    Get PDF
    El objetivo de este estudio fue analizar la tendencia en investigación sobre los determinantes y gestión de riesgos en crowdfunding de préstamos P2P, con el fin de ampliar el conocimiento a inversionistas, empresarios y formuladores de políticas sobre esta financiación disruptiva. Se utilizó un enfoque cualitativo con descripción de publicaciones enfocadas al riesgo crediticio. Seguidamente, se realizó un análisis bibliométrico de la producción científica en las bases WOS y Scopus. El análisis bibliométrico se realizó con las plataformas VOSviewer y RStudio (librerías Bibliometrix y Biblioshiny), en el que se identificaron cuatro clústeres temáticos actuales de investigación que enfocan la producción de conocimiento. A partir del análisis descriptivo se realizó una aproximación teórica con los hallazgos más relevantes. Este estudio concluye que el crowdfunding de préstamos P2P es emergente en Latinoamérica y requiere atención en el riesgo crediticio presente en los prestatarios y en la plataforma en línea, con factores que limitan al inversionista en la identificación de riesgos e interpretación de modelos que los predicen y evalúan, lo cual los expone a altas probabilidades de incumplimiento de pago por parte de los prestatarios. Por ende, es necesario fortalecer la normatividad en el contexto de los países donde se desarrolla, a fin de generar credibilidad y confianza en este mercado disruptivo

    Essays in financial technology: banking efficiency and application of machine learning models in Supply Chain Finance and credit risk assessment

    Get PDF
    The financial landscape is undergoing a significant transformation, driven by technological innovations that are reshaping traditional banking practices. This thesis examines the evolving relationship between financial technology (FinTech) and banking, specifically addressing the credit risk aspects within the domains of Supply Chain Finance (SCF) and peer-to-peer (P2P) lending. FinTech has experienced rapid growth and innovation over the past decade. It encompasses a wide range of technologies and services that aim to enhance and streamline financial processes, disrupt traditional banking models, and offer new solutions to consumers and businesses. The status of FinTech and banking is assessed through an extensive review of the current literature and empirical data. Accordingly, FinTech development has significantly impacted the financial landscape, driving innovation, competition, and customer expectations while it has exposed inefficiencies within traditional banking, it has also compelled banks to evolve and embrace technological advancements. The impact of FinTech on traditional banking models, customer behaviours, and market competition is aimed to be explored. This investigation highlights the challenges and opportunities that arise as FinTech disrupts and reshapes the banking sector, emphasizing its potential to enhance efficiency, accessibility, and customer experiences. As Chapter 3 focuses on an empirical analysis of the impact of FinTech on the operating efficiency of commercial banks in China. Further, in the context of credit risk, the thesis focuses on SCF and P2P lending, two prominent areas influenced by FinTech innovation. SCF has witnessed substantial transformation with the infusion of FinTech solutions. Digital platforms have streamlined the flow of funds within complex supply networks, enhancing the liquidity of suppliers and optimizing working capital for buyers. However, this transformation introduces new credit risk challenges. As suppliers' financial data becomes more accessible, the need for accurate risk assessment and predictive modelling becomes paramount. The integration of big data analytics, machine learning, and artificial intelligence (AI) holds the promise of refining credit risk evaluation by offering real-time insights into supplier financial health, thereby improving lending decisions and reducing defaults. Similarly, P2P lending has redefined the borrowing and lending landscape, enabling direct connections between individual borrowers and lenders. While P2P lending platforms offer speed, convenience, and access to credit for previously underserved segments, they also grapple with credit risk concerns. Evaluating the creditworthiness of individual borrowers without sufficient credit history demands innovative risk assessment methodologies. The emergence of data issues, such as imbalanced data issues, feature selection, and data processing, presents challenges in building accurate credit risk profiles for P2P lending participants. FinTech solutions play a pivotal role in creating and implementing these alternative risk assessment models. Note that, few studies in the literature investigate the benchmark of the advanced method of solving the credit risk assessment in emerging financial services. This thesis aims to address this research gap by evaluating the effectiveness of credit risk assessment models in these FinTech-driven contexts, considering both traditional methodologies and novel data-driven approaches. Chapter 4 investigates the credit risk assessment issue in Digital Supply Chain Finance (DSCF) with the Machine Learning approach and Chapter 5 emphasises the issue of data imbalance of credit risk assessment in P2P Lending. By addressing these gaps and issues, this thesis aims to contribute to the broader discourse on FinTech's role in shaping the future of banking. The findings have implications for financial institutions, policymakers, and regulators seeking to harness the benefits of FinTech while mitigating associated risks. Ultimately, this study offers insights into navigating the evolving landscape of credit risk in SCF and P2P lending within the context of an increasingly technology-driven financial ecosystem
    corecore