1,135 research outputs found
A bank customer credit evaluation based on the decision tree and the simulated annealing algorithm
C4.5 is a learning algorithm that adopts local search strategy, and it cannot obtain the best decision rules. On the other hand, the simulated annealing algorithm is a globally optimized algorithm and it avoids the drawbacks of C4.5. This paper proposes a new credit evaluation method based on decision tree and simulated annealing algorithm. The experimental results demonstrate that the proposed method is effective. Ā© 2008 IEEE
Recommended from our members
An Overview of the Use of Neural Networks for Data Mining Tasks
In the recent years the area of data mining has experienced a considerable demand for technologies that extract knowledge from large and complex data sources. There is a substantial commercial interest as well as research investigations in the area that aim to develop new and improved approaches for extracting information, relationships, and patterns from datasets. Artificial Neural Networks (NN) are popular biologically inspired intelligent methodologies, whose classification, prediction and pattern recognition capabilities have been utilised successfully in many areas, including science, engineering, medicine, business, banking, telecommunication, and many other fields. This paper highlights from a data mining perspective the implementation of NN, using supervised and unsupervised learning, for pattern recognition, classification, prediction and cluster analysis, and focuses the discussion on their usage in bioinformatics and financial data analysis tasks
The Feasibility of Credit Using C4.5 Algorithm Based on Particle Swarm Optimization Prediction
Credit is a belief that one is given to a person or other entity which is concerned in the future will fulfill all the obligations previously agreed. The objective of research is necessary to do credit analysis to determine the feasibility of a credit crunch, through credit analysis results, it can be seen whether the customer is feasible or not. The methods are is used to predict credit worthiness is by using two models, models classification algorithm C4.5 and C4.5 classification algorithm model based Particle Swarm Optimization (PSO). After testing with these two models found that the result C4.5 classification algorithm generates a value of 90.99% accuracy and AUC value of 0.911 to the level diagnostics Classification Excellent, but after the optimization with C4.5 classification algorithm based on Particle Swarm Optimization accuracy values amounted to 91.18% and the AUC value of 0.913 to the level of diagnosis Excellent Classification. These both methods have different accuracy level of 0.18%
Recommended from our members
A Spatio-Temporal Bayesian Network Classifier for Understanding Visual Field Deterioration
Progressive loss of the field of vision is characteristic of a number of eye diseases
such as glaucoma which is a leading cause of irreversible blindness in the world. Recently,
there has been an explosion in the amount of data being stored on patients who suffer from visual deterioration including field test data, retinal image data and patient demographic data. However, there has been relatively little work in modelling
the spatial and temporal relationships common to such data. In this paper we introduce a novel method for classifying Visual Field (VF) data that explicitly models these spatial and temporal relationships. We carry out an analysis of this
method and compare it to a number of classifiers from the machine learning and statistical communities. Results are very encouraging showing that our classifiers are comparable to existing statistical models whilst also facilitating the understanding of underlying spatial and temporal relationships within VF data. The results
reveal the potential of using such models for knowledge discovery within ophthalmic databases, such as networks reflecting the ānasal stepā, an early indicator of the onset of glaucoma. The results outlined in this paper pave the way for a substantial program of study involving many other spatial and temporal datasets, including retinal image and clinical data
Optimizing Indoor Navigation Policies For Spatial Distancing
In this paper, we focus on the modification of policies that can lead to
movement patterns and directional guidance of occupants, which are represented
as agents in a 3D simulation engine. We demonstrate an optimization method that
improves a spatial distancing metric by modifying the navigation graph by
introducing a measure of spatial distancing of agents as a function of agent
density (i.e., occupancy). Our optimization framework utilizes such metrics as
the target function, using a hybrid approach of combining genetic algorithm and
simulated annealing. We show that within our framework, the
simulation-optimization process can help to improve spatial distancing between
agents by optimizing the navigation policies for a given indoor environment.Comment: 9 pages, 8 figures, conference-- simulation in architecture and urban
design, in-cooperation with ACM SIGSI
Credit Scoring Based on Hybrid Data Mining Classification
The credit scoring has been regarded as a critical topic. This study proposed four approaches combining with the NN (Neural Network) classifier for features selection that retains sufficient information for classification purpose. Two UCI data sets and different approaches combined with NN classifier were constructed by selecting features. NN classifier combines with conventional statistical LDA, Decision tree, Rough set and F-score approaches as features preprocessing step to optimize feature space by removing both irrelevant and redundant features. The procedure of the proposed algorithm is described first and then evaluated by their performances. The results are compared in combination with NN classifier and nonparametric Wilcoxon signed rank test will be held to show if there has any significant difference between these approaches. Our results suggest that hybrid credit scoring models are robust and effective in finding optimal subsets and the compound procedure is a promising method to the fields of data mining
An academic review: applications of data mining techniques in finance industry
With the development of Internet techniques, data volumes are doubling every two years, faster than predicted by Mooreās Law. Big Data Analytics becomes particularly important for enterprise business. Modern computational technologies will provide effective tools to help understand hugely accumulated data and leverage this information to get insights into the finance industry. In order to get actionable insights into the business, data has become most valuable asset of financial organisations, as there are no physical products in finance industry to manufacture. This is where data mining techniques come to their rescue by allowing access to the right information at the right time. These techniques are used by the finance industry in various areas such as fraud detection, intelligent forecasting, credit rating, loan management, customer profiling, money laundering, marketing and prediction of price movements to name a few. This work aims to survey the research on data mining techniques applied to the finance industry from 2010 to 2015.The review finds that Stock prediction and Credit rating have received most attention of researchers, compared to Loan prediction, Money Laundering and Time Series prediction. Due to the dynamics, uncertainty and variety of data, nonlinear mapping techniques have been deeply studied than linear techniques. Also it has been proved that hybrid methods are more accurate in prediction, closely followed by Neural Network technique. This survey could provide a clue of applications of data mining techniques for finance industry, and a summary of methodologies for researchers in this area. Especially, it could provide a good vision of Data Mining Techniques in computational finance for beginners who want to work in the field of computational finance
- ā¦