83 research outputs found
Fuzzy and smote resampling technique for imbalanced data sets
There are many factors that could affect the performance of a classifier.One of these factors is having imbalanced datasets which could lead to problem in classification accuracy.In binary classification, classifier often ignores instances in minority class.Resampling technique, specifically, undersampling and oversampling are the techniques that are commonly used to overcome the problem related to imbalanced data sets. In this study, an integration of undersampling and oversampling techniques is proposed to
improve classification accuracy.The proposed technique is an integration between Fuzzy Distance-based Undersampling and SMOTE.The findings from the study indicate that the proposed combination technique is able to produce more balanced datasets to improve the classification accuracy
A conceptual model of enhanced undersampling technique
Imbalanced datasets often lead to decrement of classifiers’ performance.Undersampling technique is
one of the approaches that is used when dealing with
imbalanced datasets problem.This paper discusses on
the advantages and disadvantages of several
undersampling techniques.An enhanced Distancebased
undersampling technique is proposed to balance the imbalanced data that will be used for classification. The fuzzy logic has been integrated in the distance-based undersampling technique to resolve the ambiguity and bias issues
Grid load balancing using enhance ant colony optimization
This study presents a new algorithm based on ant colony
optimization for load balancing management in grid computing. The concentration is on improving the way ants search the best resources in terms of minimizing the processing time of each job and at the same time balancing the workload on available resources. An enhanced technique is proposed for the pheromone update activities. Single colony of ants is used for searching the best resources to process jobs. The credibility of the proposed algorithm was tested with other load balancing algorithim and results showed that the proposed algorithm was able to balance the load on the resources
Prapemprosesan data menggunakan teknik tapisan dalam pemodelan perceptron multi aras
This purpose of this research is to study the effect of filtering technique on house price modeling performance in predicting of learning capability of Multilayer Perceptron. This study also considered the comparison between neural network data testing modeling and multiple regression. Data is not only based on house price index but also includes other aspects which are related directly or indirectly to the house price index. From the finding, it is learnt that by using filtering technique, better prediction in house price modeling was obtained
Investigating teacher's integrity through association rule mining
The selection of teachers to attend trainings is currently done randomly, by rotation and not based on their work performance.This poses a problem in selecting the right teacher to attend the right course.Up until now, there is no intelligent model to assist the school management to determine the integrity level of teacher and assign them to the right training program.Thus, this study investigates the integrity traits of teacher using association rule technique with an aim, which can assist the school management to organize a training related to teachers’ integrity performance and to avoid sending the wrong teacher for the training.A dataset of Trainees Integrity Dataset representing 1500 secondary school teachers in Langkawi Island, Malaysia in the year 2009 were pre-processed and mined using apriori. Mining knowledge was analyzed based on demographic and integrity trait of teacher.The finding indicates that adaptability and stability are the weakest integrity trait among teachers.Teachers from the age group of 26 - 30 years are found to have lower integrity performance.However, other
demographic factor such as gender, race, and grade position of teachers were not able to reflect their low integrity level in this study.Finally, this study produces a component of trainee selection module which can be used as guideline for school management to propose suitable training programs for teacher to improve their integrity mainly on adaptability and stability traits
Semantic network representation of female related issues from the Holy Quran
Quran is the main source of knowledge and has been a major source reference for all types of problems.Female is one of terms that are very popular in the Quran.This term represents a group of human that are venerable in the aspect of social and family.Quran contains many important issues related to the female. However understanding the
issues and the solution from the Quran is difficult due to lack of understanding of Quran literature.Furthermore, female has been addressed in the Quran through several other terms such as women,lady and girls depending on the issues that being addressed.In this paper, 16 terms that are related to female and its related surah and verse have been identified.Semantic network is used to represent the issues. This study also found that, semantic network representation provide a clear and brief overview of the issue that is easy to understand and comprehend
Ant colony optimization algorithm for load balancing in grid computing
Managing resources in grid computing system is complicated due to the distributed and heterogeneous nature of the resources. This research proposes an enhancement of the ant colony optimization algorithm that caters for dynamic scheduling and load balancing in the grid computing system. The proposed algorithm is known as the enhance ant colony optimization (EACO). The algorithm consists of three new mechanisms that organize the work of an ant colony i.e. initial pheromone value mechanism, resource selection mechanism and pheromone update mechanism. The resource allocation problem is modelled as a graph that can be used by the ant to deliver its pheromone.This graph consists of four types of vertices which are job, requirement, resource and capacity that are used in constructing the grid resource management element. The proposed EACO algorithm takes into consideration the capacity of resources and the characteristics of jobs in determining the best resource to process a job. EACO selects the resources based on the pheromone value on each resource which is recorded in a matrix form. The initial pheromone value of each resource for each job is calculated based on the estimated transmission time and execution time of a given job.Resources with high pheromone value are selected to process the submitted jobs. Global pheromone update is performed after the completion of processing the jobs in order to reduce the pheromone value of resources.A simulation environment was developed using Java programming to test the performance of the proposed EACO algorithm against other ant based algorithm, in terms of resource utilization. Experimental results show that EACO produced better grid resource management solution
Resource management in grid computing using ant colony optimization
Managing resources in grid computing system is complicated due to the distributed and heterogeneous nature of the resources.Stagnation in grid computing system may occur when all jobs require or are assigned to the same resources which lead to the resources having high workload or the time taken to process a job is high.This research proposes an Enhanced Ant Colony Optimization (EACO) algorithm that caters dynamic scheduling and load balancing in the grid computing system.The algorithm consists of three new mechanisms that organize the work of an ant colony i.e. initial pheromone value mechanism, resource selection mechanism and pheromone update mechanism.The resource allocation problem is modeled as a graph that can be used by the ant to deliver its pheromone.This graph consists of four types of vertices which are job, requirement, resource and capacity that are used in constructing the grid resource management element.The proposed EACO algorithm takes into consideration the capacity of resources and the characteristics of jobs in determining the best resource to process a job.EACO selects the resources based on the pheromone value on each resource which is recorded in a matrix form.The initial pheromone value of each resource for each job is calculated based on the estimated transmission time and execution time of a given job. Resources with high pheromone value are selected to process the submitted jobs.Global pheromone update is performed after completion processing the jobs in order to reduce the pheromone value of resources.A simulation environment was developed using Java programming to test the performance of the proposed EACO algorithm against other ant based algorithm, in terms of resource utilization.Experimental results show that EACO produced better grid resource management solution
Weakest integrity traits identification of teachers using association rule mining
The government has arranged many programs for teacher development however the training is organized to fit yearly calendar without considering the right teacher for the right training.The selection of teacher to attend training is done randomly, by rotation and not based on their work performance.This paper investigate the weakest integrity trait of teacher using association rule technique with the aim can assists the school management to organize training related to teachers integrity performance and avoid sending a wrong teacher for a training.A dataset of Trainees Integrity Dataset (TID) representing 1500 secondary school teachers in Langkawi Island, Malaysia in the year 2009 were pre-processed and mined using apriori. The knowledge from the mining was analyzed based on demographic and integrity trait of teacher.The finding indicates that adaptability and stability are the weakest integrity trait among teachers.Besides that, the analysis also unable to prove that demographic factor such as the age and gender of teachers reflect their low integrity performance.The finding can be a guideline for school management to propose a suitable training program for teacher to improve integrity mainly at the adaptability and stability trait
Fuzzy distance-based undersampling technique for imbalanced flood data
Performances of classifiers are affected by imbalanced data because instances in the minority
class are often ignored. Imbalanced data often occur in many application domains including flood. If flood cases are misclassified, the impact of flood is higher than the misclassification of non-flood cases.Numerous resampling techniques such as
undersampling and oversampling have been used to overcome the problem of misclassification of
imbalanced data.However, the undersampling and
oversampling techniques suffer from elimination of
relevant data and overfitting, which may lead to
poor classification results.This paper proposes a
Fuzzy Distance-based Undersampling (FDUS) technique to increase classification accuracy. Entropy estimation is used to generate fuzzy
thresholds which are used to categorise the
instances in majority and minority classes into
membership functions. The performance of FDUS
was compared with three techniques based on Fmeasure and G-mean, experimented on flood data.
From the results, FDUS achieved better F-measure
and G-mean compared to the other techniques
which showed that the FDUS was able to reduce
the elimination of relevant data
- …