83 research outputs found

    Fuzzy and smote resampling technique for imbalanced data sets

    Get PDF
    There are many factors that could affect the performance of a classifier.One of these factors is having imbalanced datasets which could lead to problem in classification accuracy.In binary classification, classifier often ignores instances in minority class.Resampling technique, specifically, undersampling and oversampling are the techniques that are commonly used to overcome the problem related to imbalanced data sets. In this study, an integration of undersampling and oversampling techniques is proposed to improve classification accuracy.The proposed technique is an integration between Fuzzy Distance-based Undersampling and SMOTE.The findings from the study indicate that the proposed combination technique is able to produce more balanced datasets to improve the classification accuracy

    A conceptual model of enhanced undersampling technique

    Get PDF
    Imbalanced datasets often lead to decrement of classifiers’ performance.Undersampling technique is one of the approaches that is used when dealing with imbalanced datasets problem.This paper discusses on the advantages and disadvantages of several undersampling techniques.An enhanced Distancebased undersampling technique is proposed to balance the imbalanced data that will be used for classification. The fuzzy logic has been integrated in the distance-based undersampling technique to resolve the ambiguity and bias issues

    Grid load balancing using enhance ant colony optimization

    Get PDF
    This study presents a new algorithm based on ant colony optimization for load balancing management in grid computing. The concentration is on improving the way ants search the best resources in terms of minimizing the processing time of each job and at the same time balancing the workload on available resources. An enhanced technique is proposed for the pheromone update activities. Single colony of ants is used for searching the best resources to process jobs. The credibility of the proposed algorithm was tested with other load balancing algorithim and results showed that the proposed algorithm was able to balance the load on the resources

    Prapemprosesan data menggunakan teknik tapisan dalam pemodelan perceptron multi aras

    Get PDF
    This purpose of this research is to study the effect of filtering technique on house price modeling performance in predicting of learning capability of Multilayer Perceptron. This study also considered the comparison between neural network data testing modeling and multiple regression. Data is not only based on house price index but also includes other aspects which are related directly or indirectly to the house price index. From the finding, it is learnt that by using filtering technique, better prediction in house price modeling was obtained

    Investigating teacher's integrity through association rule mining

    Get PDF
    The selection of teachers to attend trainings is currently done randomly, by rotation and not based on their work performance.This poses a problem in selecting the right teacher to attend the right course.Up until now, there is no intelligent model to assist the school management to determine the integrity level of teacher and assign them to the right training program.Thus, this study investigates the integrity traits of teacher using association rule technique with an aim, which can assist the school management to organize a training related to teachers’ integrity performance and to avoid sending the wrong teacher for the training.A dataset of Trainees Integrity Dataset representing 1500 secondary school teachers in Langkawi Island, Malaysia in the year 2009 were pre-processed and mined using apriori. Mining knowledge was analyzed based on demographic and integrity trait of teacher.The finding indicates that adaptability and stability are the weakest integrity trait among teachers.Teachers from the age group of 26 - 30 years are found to have lower integrity performance.However, other demographic factor such as gender, race, and grade position of teachers were not able to reflect their low integrity level in this study.Finally, this study produces a component of trainee selection module which can be used as guideline for school management to propose suitable training programs for teacher to improve their integrity mainly on adaptability and stability traits

    Semantic network representation of female related issues from the Holy Quran

    Get PDF
    Quran is the main source of knowledge and has been a major source reference for all types of problems.Female is one of terms that are very popular in the Quran.This term represents a group of human that are venerable in the aspect of social and family.Quran contains many important issues related to the female. However understanding the issues and the solution from the Quran is difficult due to lack of understanding of Quran literature.Furthermore, female has been addressed in the Quran through several other terms such as women,lady and girls depending on the issues that being addressed.In this paper, 16 terms that are related to female and its related surah and verse have been identified.Semantic network is used to represent the issues. This study also found that, semantic network representation provide a clear and brief overview of the issue that is easy to understand and comprehend

    Ant colony optimization algorithm for load balancing in grid computing

    Get PDF
    Managing resources in grid computing system is complicated due to the distributed and heterogeneous nature of the resources. This research proposes an enhancement of the ant colony optimization algorithm that caters for dynamic scheduling and load balancing in the grid computing system. The proposed algorithm is known as the enhance ant colony optimization (EACO). The algorithm consists of three new mechanisms that organize the work of an ant colony i.e. initial pheromone value mechanism, resource selection mechanism and pheromone update mechanism. The resource allocation problem is modelled as a graph that can be used by the ant to deliver its pheromone.This graph consists of four types of vertices which are job, requirement, resource and capacity that are used in constructing the grid resource management element. The proposed EACO algorithm takes into consideration the capacity of resources and the characteristics of jobs in determining the best resource to process a job. EACO selects the resources based on the pheromone value on each resource which is recorded in a matrix form. The initial pheromone value of each resource for each job is calculated based on the estimated transmission time and execution time of a given job.Resources with high pheromone value are selected to process the submitted jobs. Global pheromone update is performed after the completion of processing the jobs in order to reduce the pheromone value of resources.A simulation environment was developed using Java programming to test the performance of the proposed EACO algorithm against other ant based algorithm, in terms of resource utilization. Experimental results show that EACO produced better grid resource management solution

    Resource management in grid computing using ant colony optimization

    Get PDF
    Managing resources in grid computing system is complicated due to the distributed and heterogeneous nature of the resources.Stagnation in grid computing system may occur when all jobs require or are assigned to the same resources which lead to the resources having high workload or the time taken to process a job is high.This research proposes an Enhanced Ant Colony Optimization (EACO) algorithm that caters dynamic scheduling and load balancing in the grid computing system.The algorithm consists of three new mechanisms that organize the work of an ant colony i.e. initial pheromone value mechanism, resource selection mechanism and pheromone update mechanism.The resource allocation problem is modeled as a graph that can be used by the ant to deliver its pheromone.This graph consists of four types of vertices which are job, requirement, resource and capacity that are used in constructing the grid resource management element.The proposed EACO algorithm takes into consideration the capacity of resources and the characteristics of jobs in determining the best resource to process a job.EACO selects the resources based on the pheromone value on each resource which is recorded in a matrix form.The initial pheromone value of each resource for each job is calculated based on the estimated transmission time and execution time of a given job. Resources with high pheromone value are selected to process the submitted jobs.Global pheromone update is performed after completion processing the jobs in order to reduce the pheromone value of resources.A simulation environment was developed using Java programming to test the performance of the proposed EACO algorithm against other ant based algorithm, in terms of resource utilization.Experimental results show that EACO produced better grid resource management solution

    Weakest integrity traits identification of teachers using association rule mining

    Get PDF
    The government has arranged many programs for teacher development however the training is organized to fit yearly calendar without considering the right teacher for the right training.The selection of teacher to attend training is done randomly, by rotation and not based on their work performance.This paper investigate the weakest integrity trait of teacher using association rule technique with the aim can assists the school management to organize training related to teachers integrity performance and avoid sending a wrong teacher for a training.A dataset of Trainees Integrity Dataset (TID) representing 1500 secondary school teachers in Langkawi Island, Malaysia in the year 2009 were pre-processed and mined using apriori. The knowledge from the mining was analyzed based on demographic and integrity trait of teacher.The finding indicates that adaptability and stability are the weakest integrity trait among teachers.Besides that, the analysis also unable to prove that demographic factor such as the age and gender of teachers reflect their low integrity performance.The finding can be a guideline for school management to propose a suitable training program for teacher to improve integrity mainly at the adaptability and stability trait

    Fuzzy distance-based undersampling technique for imbalanced flood data

    Get PDF
    Performances of classifiers are affected by imbalanced data because instances in the minority class are often ignored. Imbalanced data often occur in many application domains including flood. If flood cases are misclassified, the impact of flood is higher than the misclassification of non-flood cases.Numerous resampling techniques such as undersampling and oversampling have been used to overcome the problem of misclassification of imbalanced data.However, the undersampling and oversampling techniques suffer from elimination of relevant data and overfitting, which may lead to poor classification results.This paper proposes a Fuzzy Distance-based Undersampling (FDUS) technique to increase classification accuracy. Entropy estimation is used to generate fuzzy thresholds which are used to categorise the instances in majority and minority classes into membership functions. The performance of FDUS was compared with three techniques based on Fmeasure and G-mean, experimented on flood data. From the results, FDUS achieved better F-measure and G-mean compared to the other techniques which showed that the FDUS was able to reduce the elimination of relevant data
    • …
    corecore