Search CORE

4 research outputs found

Machine Learning applied to credit risk assessment: Prediction of loan defaults

Author: Simão Sofia Beatriz Santos
Publication venue
Publication date: 26/01/2023
Field of study

Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Data ScienceDue to the recent financial crisis and regulatory concerns of Basel II, credit risk assessment is becoming a very important topic in the field of financial risk management. Financial institutions need to take great care when dealing with consumer loans in order to avoid losses and costs of opportunity. For this matter, credit scoring systems have been used to make informed decisions on whether or not to grant credit to clients who apply to them. Until now several credit scoring models have been proposed, from statistical models, to more complex artificial intelligence techniques. However, most of previous work is focused on employing single classifiers. Ensemble learning is a powerful machine learning paradigm which has proven to be of great value in solving a variety of problems. This study compares the performance of the industry standard, logistic regression, to four ensemble methods, i.e. AdaBoost, Gradient Boosting, Random Forest and Stacking in identifying potential loan defaults. All the models were built with a real world dataset with over one million customers from Lending Club, a financial institution based in the United States. The performance of the models was compared by using the Hold-out method as the evaluation design and accuracy, AUC, type I error and type II error as evaluation metrics. Experimental results reveal that the ensemble classifiers were able to outperform logistic regression on three key indicators, i.e. accuracy, type I error and type II error. AdaBoost performed better than the remaining classifiers considering a trade off between all the metrics evaluated. The main contribution of this thesis is an experimental addition to the literature on the preferred models for predicting potential loan defaulters

Repositório da Universidade Nova de Lisboa

Machine learning no processo de risco de crédito das instituições bancárias

Author: Monteiro André Luiz
Publication venue
Publication date: 13/12/2019
Field of study

Uma vez que o sistema económico mundial se encontra em constante mudança, o estudo do risco de crédito tem uma grande importância para as instituições bancárias. Por estar associado a possíveis perdas que impactam o mercado financeiro, o processo de análise de crédito deve ser contínuo e progressivo. O atraso nos pagamentos de negócios tornou-se uma tendência, especialmente após as recentes crises financeiras. Desse modo, os bancos devem minimizar dívidas, analisar individualmente os créditos, agir com rapidez e se proteger de não pagamentos. Na mesma conjuntura, machine learning é uma tecnologia emergente para a construção de modelos analíticos, faz com que as máquinas aprendam com os dados. Com isso, efetuem análises preditivas de maneira mais rápida e eficiente. Fazer análises preditivas é muito importante e possui uma ampla gama de atuação para os bancos. Como, por exemplo: • Identificação dos melhores fatores de risco a serem utilizados na antecipação de crédito a clientes; • Obediência dos dispositivos legais; • Qualidade de dados; • Deteção de fraudes. Na criação de uma pontuação de risco de crédito bancário, automatizada, robusta e eficaz, machine learning vai ajudar na previsão da capacidade de crédito do cliente com mais precisão. O objetivo é analisar as diferentes abordagens de gestão de risco de crédito. Para tal, recorre-se a revisão de literatura de tópicos importantes, em destaque a machine learning, e ao uso de questionários. Os principais resultados mostraram que o uso de machine learning no risco de crédito bancário, ainda está em fase inicial. A maioria dos bancos já reconhece os valores que esta tecnologia pode proporcionar. Com base nesses resultados, os bancos que são tão sensíveis a mudanças, têm que sair do âmbito da teoria e investir em pequenos projetos. Só assim esta tecnologia provará a sua capacidade de melhoria, e transmitir a confiança necessária para este sector.As the global economic system is constantly changing, the study of credit risk is of great importance to banking institutions. Because it is associated with possible losses that impact the financial market, the process of credit analysis should be continuous and progressive. Late business payments have become a trend, especially after the recent financial crises. Thus, banks should minimize debt, analyze individual credits, act quickly and protect themselves from non-payment. At the same time, machine learning is an emerging technology for building analytical models, making machines learn from data. As a result, they carry out predictive analyses more quickly and efficiently. Predictive analysis is very important and has a wide range of activities for banks. For example: • Identification of the best risk factors to be used in anticipating credit to customers; • Compliance with legal provisions; • Obedience of legal provisions; • Data quality; • Fraud detection. In creating an automated, robust and effective bank credit risk score, machine learning will help predict the customer's creditworthiness more accurately. The goal is to analyze the different approaches to credit risk management. To this end, a literature review of important topics is used, especially machine learning and the use of questionnaires. The main results showed that the use of machine learning in bank credit risk is still at an early stage. Most banks already recognize the values that this technology can provide. Based on these results, banks that are so sensitive to change have to go beyond the scope of theory and invest in small projects. Only in this way will this technology prove its ability to improve and transmit the necessary confidence to this sector

Repositório Institucional do ISCTE-IUL